Features
Modular design (OSGi)
Exchangeable search-providers
Exchangeable crawlers
Exchangeable parsers
Exchangeable
URL
/content-filters
Flexible build system:
Apache Maven
Support for all prevalent protocols (
HTTP
,
FTP
, SMB/CIFS, …)
Support for prevalent data formats (
HTML
, Text,
PDF
, DOC, …)
Support for prevalent APIs
OpenSearch
DBus
Json
/REST
API
Pre-installed content filters
RegExp based blacklist filter
robots.txt
support
HTTP
based web interface for administration purposes
OpenID
support
SSL
support
Dynamically extensible by
javax.servlet
servlets
Apache Velocity
servlets
Dynamically extensible Desktop integration
Overview flow of information
SVG version