Java 구현은 문자열에 중국어 문자가 포함되어 있는지 여부를 감지합니다.

저자：Eve Cole 업데이트 시간：2025-01-31 09:36:01

코드는 매우 실용적입니다. 여기서는 말도 안되는 코드가 아닙니다. 그냥 제시하세요.

주요 기능은 문자열에 중국어 문자가 포함되어 있는지 확인하고 이를 ASCLL로 바꾸는 것입니다.

 private static String regEx = "[//u4e00-//u9fa5]"; /** * 문자열에 중국어 문자가 포함되어 있는지 확인하고 이를 ASCLL로 바꿉니다. * * @param str_para * @return str_result */ private static String isChina_Replace( String str_para ) { 패턴 p = Pattern.compile( regEx ); String str_result = str_para; str_1 = ""; String str_data[] = null; String str_return_reslut = ""; if ( str_result != null && str_result.trim().length() > 0 ) { str_data = str_result.split( "" ); for ( int i = 0; i < str_data.length; i++ ) { 일치자 m = p.matcher( str_data[i] ); /* Ld(str_data[i]); */ int count = 0; if ( m.find() ) { str_result = m.group( 0 ); getBytes( "GBK" ); str_0 = Integer.toHexString( b[0] ) str_1 = Integer.toHexString( b[1] ); str_return_reslut = str_return_reslut + "/" + conver10( str_0 ) + conver10( str_1 ) + "/"; } else { str_return_reslut = str_return_reslut + str_data[i] } } } catch ( NumberFormatException e ) { e.printStackTrace() ; } 잡기(UnsupportedEncodingExceptione ) { e.printStackTrace(); } } else { return(str_return_reslut); } return(str_return_reslut); } /* 10진수로 문자열 변환*/ public static int conver10( String str_0 ) { return(Integer.parseInt( str_0.substring) ( str_0.length() - 2, str_0.length() ), 16 ) );

수요가 높지 않은 곳에서 사용할 수 있는 조금 더 간단한 코드를 살펴보겠습니다.

 import java.util.regex.Matcher;import java.util.regex.Pattern;public 클래스 데모 {static String regEx = "[/u4e00-/u9fa5]";static Pattern pat = Pattern.compile(regEx);public static void main(String[] args) {String input = "지옥 세상!";System.out.println(isContainsChina(input));input = "hello world";System.out.println(isContainsChina(input));} public static boolean isContainsChina(String str){Matcher matcher = pat.matcher(str);boolean flg = false;if(matcher.find()) {flg = true;}flg를 반환;}

마지막으로 다양한 문자에 대한 유니코드 인코딩 범위를 첨부합니다.
* 한자: [0x4e00,0x9fa5] (또는 십진수 [19968,40869])
* 숫자: [0x30,0x39] (또는 십진수 [48, 57])
*소문자: [0x61,0x7a] (또는 십진수 [97, 122])
* 대문자: [0x41,0x5a] (또는 십진수 [65, 90])