| |
|
|
| Search Features —
Relevancy-Ranking |
|
 |
dtSearch can sort
and instantly re-sort
searches by relevancy with
respect to number of hits,
file name, file date,
etc. |
 |
Natural
language
algorithms provide automatic
term weighting, following a
"plain English" or
unstructured indexed search
request. |
| |
|
Automatic
term
weighting
is based on the
frequency and
density of hits in
your
files. |
|
For
example, in
the search
request
get me
Sam's memo on
the 1999
CorpX
takeover,
if
1999
appeared in
3,000 files,
and
Sam
appeared in
only two
files, then
Sam
would get a
much higher
relevancy
rating,
taking you
straight to
the most
"relevant"
files. |
|
 |
A
positional scoring
option works
with dtSearch's natural
language relevancy ranking to
rank documents more highly
when hits are near the top of
a file, or otherwise
clustered in a
file. |
 |
dtSearch also
includes variable term
weighting
options for both indexed and
unindexed
searches: |
| |
|
Positive
term
weighting can
place extra
emphasis on
one or more
words:
soup:8 or
recipe:3 |
|
Negative
term
weighting can
assign
negative
emphasis to
one or more
words:
red or
green or
yellow:-7 |
|
 |
|
Variable
term
weighting can
also apply to
fields:
(description:5
contains
(apple and
pear)) or
(author:2
contains
smith) |
|
|
|
| |
|
|
|
|
|
The dtSearch product
line can instantly search terabytes of
text across a desktop, network,
Internet or Intranet
site.
|
|
dtSearch products
also serve as tools for publishing,
with instant text searching, large
document collections to Web sites or
CD/DVDs.
|
 |
over two dozen indexed, unindexed,
fielded and full-text search
options |
 |
highlights
hits in HTML, XML and PDF, while
displaying embedded links, formatting and
images |
 |
converts other file types — word
processor, database, spreadsheet, email and
full-text of email attachments, ZIP, Unicode,
etc. — to HTML for display with highlighted
hits |
 |
built-in Spider adds a third-party
or other Web site (public, secure content,
password accessible, etc.) to your searchable
database |
 |
Spider supports Web-based
content (HTML, PDF, XML, etc.) as well as
dynamically-generated content (ASP.NET, MS CMS,
SharePoint, etc.) |
| General supported file
types |
| SQL and similar data
sources |
|