Norconex Web dan crawler sistem file adalah crawler fitur lengkap (atau laba-laba) yang dapat memanipulasi dan menyimpan data yang dikumpulkan dalam gudang pilihan Anda (misalnya, mesin pencari). They are very flexible, powerful, easy to extend, and portable. They can be used command-line with file-based configuration on any OS or embedded into Java applications using well-documented APIs.
Visit the website for binary downloads and documentation: https://opensource.norconex.com/crawlers/
This branch holds version 4 code, which is still in development.
For the latest stable release of Norconex Web Crawler, use the version 3 branch.
As of Feb 24, 2024, the default main branch holds code for the upcoming version 4 crawler stack. It is now a mono-repo containing all Norconex crawler-related projects previously maintained in separate repos. Semua proyek dalam laporan mono ini sekarang akan dirilis secara bersamaan dan berbagi nomor versi yang sama.
Sampai V4 secara resmi dilepaskan, cabang ini tidak boleh dianggap stabil.
| Map | ID Artefak | Membangun |
|---|---|---|
| crawler/core/ | Tes NX-Crawler-Core | |
| crawler/fs/ | nx-crawler-fs | |
| crawler/web/ | nx-crawler-web | |
| pengimpor/ | NX-Importer | |
| Committer/Amazoncloudsearch/ | NX-Committer-Amazoncloudsearch | |
| Committer/Apachekafka/ | NX-Committer-Apachekafka | |
| Committer/AzureCognitifearch/ | NX-Committer-AzureCognitivesearch | |
| committer/core/ | NX-Committer-Core | |
| Committer/Idol/ | NX-Committer-Idol | |
| Committer/Elasticsearch/ | NX-Committer-Elasticsearch | |
| committer/neo4j/ | nx-committer-neo4j | |
| committer/solr/ | NX-Committer-Solr | |
| committer/sql/ | NX-Committer-SQL |
Semua proyek dalam repositori ini berbagi ID grup Maven yang sama:
com.norconex.crawler