I am rather disappointed the way zoom is dealing with
unicode while indexing pages containing Bengali - an
Asian language, making use of zoom search NOT possible.
Words like this
are being broken up at & # 2509 ; (read without spaces)
so that what is indexed is
In "Indexing word rules" there is no option
to add own rule or to correct this
Can there be any quickfix / work around ?
PS : I find that this character
্ seen on screen as
is being allotted -1 . I think this may be causing the problem. How t oprevent this ?? Incidentally this is same as & # 2509 ; (read without spaces)
For example you may create a text or html file containg following
and now enter search term
zoomsearch will NOT find it
However it will find
unicode while indexing pages containing Bengali - an
Asian language, making use of zoom search NOT possible.
Words like this
Code:
গ্র
so that what is indexed is
Code:
গ ্ র
to add own rule or to correct this
Can there be any quickfix / work around ?
PS : I find that this character
্ seen on screen as
Code:
্
For example you may create a text or html file containg following
Code:
গ্র
Code:
গ্র
However it will find
Code:
গ
Comment