Normalize strings before searching
After research on a reported snark issue that was actually caused by an external tool, we've learned a little about unicode normalization and java. For non-ascii searches to work reliably, we need to normalize the search string and the searched strings, probably to NFD (decomposed, which is faster than NFC that does decompose-then-compose).
ref: java.text.Normalizer.normalize()