Search Technology: SasaTAC - Crawler Management System
A typical Crawler is a program or automated script, which browses the World Wide Web in a methodical, automated manner. It is designed for the most general (rather than specific) capture of web content. These crawlers are mainly used to create an indexed copy of all the visited pages in order to fulfill searches of unstructured content.
Experience Huge Savings In Programming Time And Expense:
Proprietary SasaTAC Seekers (using regular expressions in PHP) collect selective information from the same websites and the deep web bypassing unimportant content. Bandwidth and database storage efficiencies of up to 95% are typical with the SasaTAC Seeker vs. Open Source crawlers. The SasaTAC Seeker will get your important content faster with similar programming time efficiencies.
The following chart represents the power of the SasaTAC Seekers vs. Open Source crawlers:
| Function | Open Source Crawler | SasaTAC Seeker |
|---|---|---|
| Identification of generic information in different contexts | ![]() |
![]() |
| Identification of specific contextual data (not specific type) in different contexts (see reference below) | ![]() |
|
| Seeker processes data, assigns a unique ID and delivers information | ![]() |
|
| Any target text based format supported and displayed (XML, HTML, JS, JSon, Ajax, CSV) | ![]() |
|
| Multi-connection protocols supported: HTTP, FTP, IMAP, POP | ![]() |
Get Complete Results With Multiple Request Automation:
SasaRecur is a proprietary Thread technology, automated commands that operate independent of one another, yet collaborate, that trigger redundant instances of a single PHP script. Think of SasaRecur as a cloning devise that saves processor, memory and programming effort and provides more comprehensive and complete results for your unique search needs.
Collect Content Anonymously And Efficiently:
SasaStealth is an important technology that enables SasaTAC Seekers to anonymously collect desired content from our targeted sources. Daily, SasaStealth identifies and independently rates hundreds of proxies for efficiency and instructs SasaTAC Seekers to use the most trusted proxies to facilitate your content collection.
Monitor Site Sources in Real Time:
SasaView is a separate and supportive application that provides graphic real-time monitoring of SasaTAC Seekers. Since websites change frequently, View alerts our technicians when SasaTAC Seekers encounter data collection problems and corrective measures are immediately taken.
Collect Fresh Content; Receive Alerts When It Changes:
SasaFresh is a proprietary technology that triggers mini Seekers to measure the status of the delivered content (current, changed or removed) in real time. Teamed with our WatchDog Alert feature, SasaFresh alerts you to content changes ensuring that you receive timely and accurate content.
Efficient Data Mining, Retrieval and Backup:
The Sasa Database has been specially designed to accommodate the content delivered and indexed for rapid retrieval by SasaTAC Seekers. This content can be delivered to your hosting environment or can reside with our servers depending upon your unique needs.
Statistical Metrics At Your Fingertips:
SasaIndex is an analytics tool that can parse content in order to produce valuable statistical analyses, reports and trends. The information is processed using the results of search criteria. These statistical analyses are constantly changing as new content reaches the database.
Technologies employed: LAMP platform, Linux based; MySQL, Apache and PHP.

