| |
|
|
 |
|
Language
Extension Pack extends
dtSearch's built-in
Unicode international
language support to add
customized noise word
list and stemming rules
for over 25 European
languages.
The dtSearch product line
includes Unicode support,
allowing indexing and searching
of the many hundreds of
languages supported by the
Unicode standard. Supplementing
dtSearch's built-in Unicode
support, noise word lists and
stemming rules (to find
different linguistic variations
on the same root word) for over
two dozen European languages.
From a sample Cyrillic white
paper authored by dtSearch's UK
distributor: dtSearch
"includes a mapping from the
Cyrillic 'i' to the Latin 'i'
and thus if you search on web
pages spidered by dtSearch you
would find all the web pages,
irrespective of whether you
have a Ukrainian keyboard or a
Russian keyboard and make the
error of substituting the Latin
'i'. It is this depth of
experience that distinguishes
dtSearch from many of the newer
entrants to the world of search
technology."
More
from Cyrillic white
paper;
More on Language
Extension Pack
|
|
 |
|
Encyclopaedia
Britannica’s
cross-language
morphological search plug
in integrates with
dtSearch.
With a focus on Arabic, Farsi
and other Middle Eastern
Languages, Encyclopaedia
Britannica has developed a rich
product suite, which allows
English speaking users to
review and analyze foreign
language source data.
Components of the product suite
include Britannica’s Cross
Language Morphological Analysis
(BMA), Cross Language Entity
Extraction (EntX), and Embedded
Translation Layer (ETL).
“We are delighted to
partner with dtSearch and
provide state of the art
foreign language solutions for
our customers. Britannica’s
morphology suite seamlessly
integrates with the dtSearch
Engine developer APIs, enabling
users to use English language
queries to search for foreign
languages, overcoming
morphological complexity and
ambiguity. All methods enable
smooth and transparent
integration, adding
Britannica’s language
capabilities while maintaining
the full range of dtSearch’s
flexible search capacity.”
More
|
|
 |
|
Basis
Technology’s
Rosette®
Linguistics Platform
integration now
accessible through
dtSearch API.
The Rosette Linguistics
Platform helps applications
unlock the meaning of
unstructured text by
determining the language, and
identifying the basic
linguistic features and
structure. Relying on code that
is unique to each particular
language, the analyzers result
in a highly accurate analysis.
The resulting API integration
provides morphological analysis
for enhanced Chinese, Japanese
and Korean language support.
“We're pleased to be working
with dtSearch to provide their
customers with solutions for
enabling multilingual
information
processing.” More
|
|
 |
|
Quicktionary
Engine expands concept
searching in dtSearch
into mulltilingual
dimensions.
With Ligature's Quicktionary
Engine, a search for an English
word can automatically retrieve
its foreign language
equivalents— or vice versa.
“Quicktionary Engine takes
the power of concept searching
in dtSearch, and extends that
automatically into multilingual
dimensions. Quicktionary Engine
translates – with lightning
speed – your search term into
other languages, and then
submits the results along with
the original terms into the
dtSearch Engine. The result is
super–powered international
language search.”
More
|
|
 |
|
Content
Analyst™ Technology
platform adds advanced
content analysis to
dtSearching.
Content Analyst is now offering
a new Content Analyst
Technology platform, combining
Content Analyst's specialized
categorization and semantic
analytics with the dtSearch
Engine's text and meta data
searching. The platform
provides extensive OEM
customization options for its
users. “We've combined
semantic analytics and multiple
language processing techniques
with dtSearch's own keyword,
meta data, Boolean, fuzzy and
other text search capabilities.
The result is a major leap
forward in information access
technologies.”
More
|
|
 |
|
Sovren teams
with dtSearch for
comprehensive recruitment
developer component
suite.
A software-components-only
firm, Sovren develops and
markets a full suite of
developer components for the
recruitment market. Sovren’s
solutions are multilingual and
are used in job boards,
assessment companies, applicant
tracking systems, corporate HR,
recruitment firms, research
firms, and HRIS/HCM systems
worldwide. “The dtSearch
Engine is fast, effective and a
perfect fit for the recruitment
industry. Building on top of
the dtSearch Engine APIs, we
have added advanced text
analysis specifically geared
for the recruitment
market.”
More
|
|
 |
|
Pinpoint
Labs’ SafeCopy 2
integrates with dtSearch
products to provide
forensically-sound
electronically stored
information (ESI)
collection for dtSearch
users.
Pinpoint Labs specializes in
computer forensics software and
services. The company’s
SafeCopy 2 integrates with
dtSearch for ESI chain of
custody handling of retrieved
files. dtSearch provides “a
great way to do text searches”
and “can be used effectively to
create file lists that can be
then used by SafeCopy to
collect those files … a lot of
programs already have dtSearch
integrated into them.” For
details on forensically-sound
ESI file collection through
SafeCopy for dtSearch users,
please visit Pinpoint’s
Webinar
presentation.
|
|
 |
|
Bitext
integrates dtSearch with
NaturalFinder linguistic
analysis
suite. Bitext
develops linguistic
technologies for natural
language understanding in
different languages,
including Spanish and
English. Its
NaturalFinder product
suite integrates with the
dtSearch Engine to
provide enhanced
linguistic analysis in
web-based and other
environments.
NaturalFinder also
includes DataNet, which
expedites semantic
relations management, and
DataSpell, which detects
spelling and typographic
errors and suggests the
correct search query.
“We liked the quality
of dtSearch's
documentation and code
samples. The readiness of
its technical support
service made integration
a simple and low-cost
task.”
More
|
|
|
|
|
The dtSearch product
line can instantly search terabytes of
text across a desktop, network,
Internet or Intranet
site.
|
|
dtSearch products
also serve as tools for publishing,
with instant text searching, large
document collections to Web sites or
CD/DVDs.
|
 |
over two dozen indexed, unindexed,
fielded and full-text search
options |
 |
highlights
hits in HTML, XML and PDF, while
displaying embedded links, formatting and
images |
 |
converts other file types — word
processor, database, spreadsheet, email and
full-text of email attachments, ZIP, Unicode,
etc. — to HTML for display with highlighted
hits |
 |
built-in Spider adds a third-party
or other Web site (public, secure content,
password accessible, etc.) to your searchable
database |
 |
Spider supports Web-based
content (HTML, PDF, XML, etc.) as well as
dynamically-generated content (ASP.NET, MS CMS,
SharePoint, etc.) |
| General supported file
types |
| SQL and similar data
sources |
|