dtSearch Beta Improves PDF Search Highlighting, Removes Plug-in Want


dtSearch has introduced a model 2026.01 beta that simplifies how customers see highlighted search ends in PDF recordsdata. The brand new launch eliminates the necessity for a separate PDF highlighter plug-in, a change that applies to dtSearch enterprise and developer merchandise, together with SDKs for Home windows, Linux, and macOS. These merchandise search terabytes of blended on-line and offline knowledge immediately, operating on premises or within the cloud, similar to on Azure or AWS.

The principle function of the brand new model is improved PDF hit highlighting. The brand new course of highlights search hits by including annotations on to the PDF file. This implies PDF recordsdata now work like different supported knowledge sorts—similar to Microsoft Workplace recordsdata and emails with attachments—displaying recordsdata with multicolor hit highlighting for any variety of concurrent customers.

dtSearch proprietor David Thede advised SD Instances in an interview that the previous strategy of utilizing an Adobe Acrobat Reader plug-in turned more and more untenable in a browser atmosphere. The brand new technique gives a a lot cleaner means for individuals so as to add PDF highlighting of their purposes. Thede defined how the system modified: “The important thing to getting that work is that we wanted to have the ability to add the highlights as annotations within the pdf file, so relatively than producing html from pdf, we take an current pdf and we stick the annotations on it, after which serve that.” 

Within the new model, dtSearch has a approach to work with browsers that use the open-source pdf.js mission, Thede mentioned. The Firefox browser, like many browsers, have JavaScript-based PDF viewers based mostly on that mission. “So, in our dtSearch desktop product we will embed a viewer window that has pdf.js used to show the pdf file.  We will do the hit navigation and the hit highlighting on high of that, however we will additionally do it in our web-based merchandise.”

dtSearch merchandise embrace a Terabyte Indexer that may index a terabyte of textual content throughout many sources, together with emails with nested attachments and on-line knowledge. Listed search is usually instantaneous, even when protecting terabytes of information with concurrent customers. The product line affords over 25 search options, together with full-text and metadata choices. It helps Unicode for lots of of worldwide languages and affords forensics-oriented choices. SDKs can be found for C++, Java, and .NET APIs, they usually assist databases like SQL and NoSQL.

Thede burdened the worth of the brand new PDF function. He mentioned, “Having the ability to spotlight hits in PDF recordsdata after a search is a really good factor to have the ability to do, as a result of PDF is so extensively used”. He famous that it is a large time saver for professionals, similar to attorneys reviewing lengthy paperwork1

Concerning AI integration, Thede confirmed that dtSearch doesn’t embrace AI in its merchandise. He famous this determination is tied to buyer safety considerations: “Our clients are usually establishments which can be extraordinarily involved about confidentiality”. Nevertheless, Thede added that dtSearch plans to have a look at methods to present customers the instruments to attach their search outcomes with AI after they select to take action.

 

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles