worldbrain

worldbrain/remove-stopwords

A simple repository to remove 'irrelevant for search' words, support for 51 languages

JavaScript
26
3
MIT License

This Node.js library removes common "stopwords" from text to improve search indexing efficiency, supporting 51 languages with customizable stopword lists. It's designed for developers building search or NLP systems who need to filter out frequent, low-meaning words across diverse languages, particularly for large-scale document processing like WorldBrain's use case.

Total donated
Undistributed
Share with your subscribers: