Post Reply 
Problem with tag option "ignore"
09-05-2016, 23:06
Post: #1
Problem with tag option "ignore"
When I use [all].ignore.sort={The,A,Les,Le,L',La,Der,Die,Das,Ein,Eine}, "La primavera" is sorted correctly, but "L'estate", "L'autunno", and "L'inverno" are not.

This happens because the articles in the option parameter are matched only if followed by a space. Can this be changed to match at a word boundary (using a RE with \b)?
Find all posts by this user
Quote this message in a reply
10-05-2016, 22:49
Post: #2
RE: Problem with tag option "ignore"
From a quick web search, it appears that different regex processors have different interpretations of word characters and non-word characters and it could be quite tricky to implement this in a fully general way that works with all Unicode word characters and non-word characters. Java has some support for \b that apparently understands Unicode word characters and I will do some investigation to find out whether this would be a suitable solution.
Find all posts by this user
Quote this message in a reply
Post Reply 


Forum Jump:


User(s) browsing this thread: 1 Guest(s)