
dtSearch has introduced a model 2026.01 beta that simplifies how customers see highlighted search leads to PDF information. The brand new launch eliminates the necessity for a separate PDF highlighter plug-in, a change that applies to dtSearch enterprise and developer merchandise, together with SDKs for Home windows, Linux, and macOS. These merchandise search terabytes of blended on-line and offline information immediately, operating on premises or within the cloud, corresponding to on Azure or AWS.
The primary function of the brand new model is improved PDF hit highlighting. The brand new course of highlights search hits by including annotations on to the PDF file. This implies PDF information now work like different supported information sorts—corresponding to Microsoft Workplace information and emails with attachments—displaying information with multicolor hit highlighting for any variety of concurrent customers.
dtSearch proprietor David Thede advised SD Occasions in an interview that the previous method of utilizing an Adobe Acrobat Reader plug-in grew to become more and more untenable in a browser setting. The brand new technique gives a a lot cleaner means for individuals so as to add PDF highlighting of their functions. Thede defined how the system modified: “The important thing to getting that work is that we wanted to have the ability to add the highlights as annotations within the pdf file, so slightly than producing html from pdf, we take an current pdf and we stick the annotations on it, after which serve that.”
Within the new model, dtSearch has a approach to work with browsers that use the open-source pdf.js mission, Thede mentioned. The Firefox browser, like many browsers, have JavaScript-based PDF viewers primarily based on that mission. “So, in our dtSearch desktop product we are able to embed a viewer window that has pdf.js used to show the pdf file. We are able to do the hit navigation and the hit highlighting on prime of that, however we are able to additionally do it in our web-based merchandise.”
dtSearch merchandise embrace a Terabyte Indexer that may index a terabyte of textual content throughout many sources, together with emails with nested attachments and on-line information. Listed search is usually instantaneous, even when masking terabytes of information with concurrent customers. The product line presents over 25 search options, together with full-text and metadata choices. It helps Unicode for lots of of worldwide languages and presents forensics-oriented choices. SDKs can be found for C++, Java, and .NET APIs, and so they help databases like SQL and NoSQL.
Thede careworn the worth of the brand new PDF function. He mentioned, “With the ability to spotlight hits in PDF information after a search is a really good factor to have the ability to do, as a result of PDF is so extensively used”. He famous that it is a large time saver for professionals, corresponding to attorneys reviewing lengthy paperwork1
Relating to AI integration, Thede confirmed that dtSearch doesn’t embrace AI in its merchandise. He famous this determination is tied to buyer safety issues: “Our clients are typically establishments which are extraordinarily involved about confidentiality”. Nevertheless, Thede added that dtSearch plans to have a look at methods to provide customers the instruments to attach their search outcomes with AI after they select to take action.