Category: Search Techniques

Using Elastic Search Analyzer to remove Stop Words from a text

Elasticsearch is a fancy application used in many cases for a search  layer or an analytics engine. What is also interesting, is the set of features that Elastic Search has when it comes to Natural Language Processing. While working on...

/ November 9, 2017

How to remove stop words from a document or a bundle of documents

Although there are different ways of removing stop words from a document (or a bundle of documents), an easy way is to do so with the NLTK (Natural Language Toolkit) on Python. You can use the stopwords lists from NLTK...

/ February 8, 2017

Information Management and Information Retrieval Modules

I was given a chance to co-author (with Prof. Dr. D. Doherr) an article for the scientific journal “Praxis der Informationsverarbeitung und Kommunikation”. The article describes some innovations in the Humbold Digital Library Project in the field of Information Retrieval...

/ November 11, 2009
Stop Words

List of English Stop Words

Stop Words are words which do not contain important significance to be used in Search Queries. Usually these words are filtered out from search queries because they return vast amount of unnecessary information. A better definition is provided below: “Words...

/ April 14, 2009