Here you can find the source of removeDiacritics(String st)
Parameter | Description |
---|---|
st | a parameter |
public static String removeDiacritics(String st)
//package com.java2s; /**/*from ww w.j a v a 2s .c o m*/ * PTStemmer - A Stemming toolkit for the Portuguese language (C) 2008-2010 Pedro Oliveira * * This file is part of PTStemmer. * PTStemmer is free software: you can redistribute it and/or modify * it under the terms of the GNU Lesser General Public License as published by * the Free Software Foundation, either version 3 of the License, or * (at your option) any later version. * * PTStemmer is distributed in the hope that it will be useful, * but WITHOUT ANY WARRANTY; without even the implied warranty of * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the * GNU Lesser General Public License for more details. * * You should have received a copy of the GNU Lesser General Public License * along with PTStemmer. If not, see <http://www.gnu.org/licenses/>. * */ import java.text.Normalizer; public class Main { /** * Remove diacritics (i.e., accents) from String * @param st * @return */ public static String removeDiacritics(String st) { st = Normalizer.normalize(st, Normalizer.Form.NFD); return st.replaceAll("[^\\p{ASCII}]", ""); } }