How to search in text files for words ignoring special characters. How to ignore accents and German umlauts,
or even different punctuation like slashes,
dots and quotes, with a Freeware tool for Windows and Linux/Mac.
|
After download, run the tool by double click, then click on Open
and select a directory from which you want to load all text files.
All ASCII text files from that folder will be loaded, like all
.txt, .ini, .html, readme or source code files.
accent insensitive search
When searching through text in European languages, containing
special characters like � (a accent) or � (a umlaut), it is
sometimes helpful to treat those characters like a simple "a".
This can be done by activating accent insensitive search:
1. Select "mode / ignore accents".
2. now for example, type a German word like "blatt",
and it should find both the words "blatt" and "bl�tter"
(finding both the singular and plural forms of the word).
Accent insensitive search is supported only on windows systems using
- Codepage 1252 (Western Europe)
- Codepage 1250 (Central Europe)
to find out what codepage your system is using, click on "Mode"
then hoover the mouse over "ignore accents". If DView says
"not supported" then accent insensitive search may work
only with a few characters, or none at all.
punctuation insensitive search
Select "mode / ignore punctuation" for puncuation insensitive search:
- slashes \ and / are treated as the same,
helpful for text with mixed Windows/Unix filename formats.
- all quotations characters are treated as the same:
" ' ` � and some more,
in detail the ISO 8859-1 character codes 0x22, 0x27, 0x60,
0x82, 0x84, 0x8B, 0x91, 0x92, 0x93, 0x94, 0x9B, 0xB4, 0xBB.
- , and . are treated as the same,
helpful for text with mixed number formats.
Punctuation insensitive search is supported only on windows systems
using the code pages:
- Codepage 1252 (Western Europe)
- Codepage 1250 (Central Europe)
to find out what codepage your system is using, click on "Mode"
then hoover the mouse over "ignore punctuation". If DView says
"not supported" then punctuation insensitive search may work
only with a few characters, or none at all.
|