News Search Engine

How to use
- Install python 3.4+ environment (recommended Anaconda or Miniconda)
- Install the lxml html parser, the command is
pip install lxml - Install the jieba word participle component, the command is
pip install jieba - Install Flask Web Framework, the command is
pip install Flask - Enter the web folder and run the main.py file
- Open the browser and visit http://127.0.0.1:5000/ Enter keywords to start testing
If you want to grab the latest news data and build an index, run ./code/setup.py with one click, and then test it according to the above method.
2020.4.5: Added a crawler program to capture China News Network. First run ./code/spider.chinanews.com.py to crawl the news in the last 5 days (about 2500); then comment ./code/setup.py line 38 and run it to automatically build the index.
Project Introduction
- Building a search engine with me (I) Introduction
- Build a search engine with me (II) Web crawler
- Build a search engine with me (III) Build an index
- Work with me to build a search engine (IV) search model
- Build a search engine with me (Five) Recommended reading
- Build a search engine with me (VI) system display
- Building a search engine with me (7) Summary of the outlook
- Build a search engine with me (eight) Update crawler & modify scoring & online deployment
grateful