How to deal with diacritics in parsed text

Discussion in 'A-Parser Support Forum' started by r3dn4x, Feb 4, 2016.

  1. r3dn4x

    r3dn4x A-Parser Enterprise License
    A-Parser Enterprise

    Joined:
    Jan 22, 2014
    Messages:
    14
    Likes Received:
    2
  2. r3dn4x

    r3dn4x A-Parser Enterprise License
    A-Parser Enterprise

    Joined:
    Jan 22, 2014
    Messages:
    14
    Likes Received:
    2
    I ended up just doing a bunch of regex replaces, in case anyone else runs into this:

    [ÀÁÂÃÄÅ] to A
    [àáâãäå] to a
    [ÈÉÊË] etc
    [é]
    [í]
    [ó]
    [úü]
    [ñ]
    [Ñ]
     
  3. Support

    Support Administrator
    Staff Member A-Parser Enterprise

    Joined:
    Mar 16, 2012
    Messages:
    4,503
    Likes Received:
    2,148
    This is the only way solve the problem.
     

Share This Page