Efficient indexing and searching (1M Weibo 500M data is indexed in 28 seconds, search response time is 1.65 milliseconds, and search QPS is 19K). Support Chinese word segmentation (concurrent word segmentation using the sego word segmentation package, speed 27MB/sec). Support to calculate the proximity distance of keywords in the text (token proximity). When a request to add a document to the index comes in, the main coroutine will send the text to be segmented to a word segmentation coroutine through a channel, and the coroutine will segment the text and send it to a word segmentation through another channel. Indexer coroutines. The indexer coroutine builds an inverted index from the search keyword to the document, and the inverted index table is stored in memory for quick recall. The main coroutine receives the user's request, segments the request phrase in the main coroutine, and then sends it to the indexer through the channel.

Features

  • Support calculation of BM25 correlation
  • Support for custom scoring fields and scoring rules
  • Support online addition and deletion of indexes
  • Support persistent storage
  • Enables distributed indexing and searching
  • Released under the commercial-friendly Apache License

Project Samples

Project Activity

See All Activity >

Categories

Text Editors, Search

License

Apache License V2.0

Follow Wukong

Wukong Web Site

You Might Also Like
Gen AI apps are built with MongoDB Atlas Icon
Gen AI apps are built with MongoDB Atlas

The database for AI-powered applications.

MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Wukong!

Additional Project Details

Programming Language

Go

Related Categories

Go Text Editors, Go Search Software

Registered

2022-02-10