The removal of noise words ("a, an, then, of...") from indexing is a good move. But if you then include these words in an exact phrase search, it will return no results as the exact string is not in the index.
So a book title "Tess of the d'Urbervilles" will be indexed as "Tess d'Urbervilles", but someone typing in the full title will never find it.
I can, of course, remove the stripping of the noise words on indexing so I get the full title in the index. But stripping noise words IS a good idea. Does it not, then, make logical sense to strip the noise words from any search phrase submitted so that it matches the index?
Thanks.
So a book title "Tess of the d'Urbervilles" will be indexed as "Tess d'Urbervilles", but someone typing in the full title will never find it.
I can, of course, remove the stripping of the noise words on indexing so I get the full title in the index. But stripping noise words IS a good idea. Does it not, then, make logical sense to strip the noise words from any search phrase submitted so that it matches the index?
Thanks.
Comment