Use technology golang+goleveldb
Built-in Xiaobai database system (text-level full-text index database system), built-in "full-text index", no need for dictionary segmentation, but there are never words that cannot be searched.
Research has been gradually improved and improved from the original "Qianlong Tripitaka Search Engine" and "Siku Quanshu Search Engine".
It can be used to organize a large amount of information and has a search function at the text-level level.
It can also be used only as a search intermediary, such as: site search; corporate search engines, etc.
The independently developed traversal word segmentation technology, a breakthrough technology for search engines, does not require vocabulary segmentation, and the search success rate is 100%.
Thesaurus is the core of search engines. The word segmentation is the eyes of search engines. Without eyes, you can't see anything and search for nothing.
However, the vocabulary database is basically difficult to perfect, and it is impossible to achieve completeness. Therefore, there must be some words that cannot be searched for.
Especially for new words, it is impossible to search for new words at the first time because there is no in the vocabulary library.
If new words cannot be searched, it is equivalent to killing the driving force for innovation, especially in the e-commerce field.
Even if the word segmentation management adds new words to the library as soon as possible, to search, it is necessary to go through traversal of all the original data to get the result.
The larger the data volume of the system, the more cautious it is to add new words and the slower the time.
Research, abandons the dictionary.
If the word segmentation library is the eyes of other search engines, this eye is the naked eye. The eyes that traverse participle are the heavenly eyes.
Other word segmentation techniques will lead to the probability of not being able to search results due to the incompleteness of the word segmentation database. In other words, the search success rate is 100%.
To give an extreme example:
Turn all one article upside down and search with the reversed words as well.
Other word participle techniques are probably not able to search for anything.
The search success rate is 100% in traversal word segmentation technology.
You can customize the search granularity.
Common search engines such as Google and Baidu, the search granularity is the entire article.
Research, you can customize to paragraphs, sentences, etc.
Usually, precise granularity is defined as a sentence.
Tens of billions of data, millisecond response.
Because there is no need for word segmentation and no need to parse word segmentation, it has higher performance than search engines with other word segmentation technologies.
Add real-time searches in real time.
10G-level text data only requires dozens of M of memory.
Open the executable file of the corresponding system and then run it.