Package com.ibm.icu.charset
Class CharsetCompoundText
java.lang.Object
java.nio.charset.Charset
com.ibm.icu.charset.CharsetICU
com.ibm.icu.charset.CharsetCompoundText
- All Implemented Interfaces:
Comparable<Charset>
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescription(package private) class
(package private) class
-
Field Summary
FieldsModifier and TypeFieldDescriptionprivate static final byte
private static final byte
private static final byte
private static final byte
private static final byte
private static final byte
private static final byte[][]
private static final byte[]
private static final byte
private static final byte
private static final byte
private static final byte
private static final byte
private static final byte
private static final byte
private static final byte
private static final byte
private CharsetMBCS[]
private static final byte
private static final byte
private byte
Fields inherited from class com.ibm.icu.charset.CharsetICU
codepage, conversionType, hasFromUnicodeFallback, hasToUnicodeFallback, icuCanonicalName, maxBytesPerChar, maxCharsPerByte, minBytesPerChar, name, options, platform, ROUNDTRIP_AND_FALLBACK_SET, ROUNDTRIP_SET, subChar, subChar1, subCharLen, unicodeMask
-
Constructor Summary
ConstructorsConstructorDescriptionCharsetCompoundText
(String icuCanonicalName, String javaCanonicalName, String[] aliases) -
Method Summary
Modifier and TypeMethodDescriptionprivate static int
findNextEsc
(ByteBuffer source) private static byte
findStateFromEscSeq
(ByteBuffer source, byte[] toUBytes, int toUBytesLength) private static byte
getState
(int codepoint) (package private) void
getUnicodeSetImpl
(UnicodeSet setFillIn, int which) This follows ucnv.c method ucnv_detectUnicodeSignature() to detect the start of the stream for example U+FEFF (the Unicode BOM/signature character) that can be ignored.private static boolean
isASCIIRange
(int codepoint) private static boolean
isCompoundS1
(int codepoint) private static boolean
isCompoundS2
(int codepoint) private static boolean
isCompoundS3
(int codepoint) private static boolean
isIBM874
(int codepoint) private static boolean
isIBM912
(int codepoint) private static boolean
isIBM913
(int codepoint) private static boolean
isIBM914
(int codepoint) private static boolean
isIBM915
(int codepoint) private static boolean
isIBM916
(int codepoint) private static boolean
isIBM923
(int codepoint) private static boolean
isISO8859_14
(int codepoint) private void
Methods inherited from class com.ibm.icu.charset.CharsetICU
contains, forNameICU, getCharset, getCompleteUnicodeSet, getNonSurrogateUnicodeSet, getUnicodeSet, isFixedWidth, isSurrogate
Methods inherited from class java.nio.charset.Charset
aliases, availableCharsets, canEncode, compareTo, decode, defaultCharset, displayName, displayName, encode, encode, equals, forName, forName, hashCode, isRegistered, isSupported, name, toString
-
Field Details
-
fromUSubstitution
private static final byte[] fromUSubstitution -
myConverterArray
-
state
private byte state -
INVALID
private static final byte INVALID- See Also:
-
DO_SEARCH
private static final byte DO_SEARCH- See Also:
-
COMPOUND_TEXT_SINGLE_0
private static final byte COMPOUND_TEXT_SINGLE_0- See Also:
-
COMPOUND_TEXT_SINGLE_1
private static final byte COMPOUND_TEXT_SINGLE_1- See Also:
-
COMPOUND_TEXT_SINGLE_2
private static final byte COMPOUND_TEXT_SINGLE_2- See Also:
-
COMPOUND_TEXT_SINGLE_3
private static final byte COMPOUND_TEXT_SINGLE_3- See Also:
-
IBM_915
private static final byte IBM_915- See Also:
-
IBM_916
private static final byte IBM_916- See Also:
-
IBM_914
private static final byte IBM_914- See Also:
-
IBM_874
private static final byte IBM_874- See Also:
-
IBM_912
private static final byte IBM_912- See Also:
-
IBM_913
private static final byte IBM_913- See Also:
-
ISO_8859_14
private static final byte ISO_8859_14- See Also:
-
IBM_923
private static final byte IBM_923- See Also:
-
NUM_OF_CONVERTERS
private static final byte NUM_OF_CONVERTERS- See Also:
-
SEARCH_LENGTH
private static final byte SEARCH_LENGTH- See Also:
-
escSeqCompoundText
private static final byte[][] escSeqCompoundText -
ESC_START
private static final byte ESC_START- See Also:
-
-
Constructor Details
-
CharsetCompoundText
-
-
Method Details
-
isASCIIRange
private static boolean isASCIIRange(int codepoint) -
isIBM915
private static boolean isIBM915(int codepoint) -
isIBM916
private static boolean isIBM916(int codepoint) -
isCompoundS3
private static boolean isCompoundS3(int codepoint) -
isCompoundS2
private static boolean isCompoundS2(int codepoint) -
isIBM914
private static boolean isIBM914(int codepoint) -
isIBM874
private static boolean isIBM874(int codepoint) -
isIBM912
private static boolean isIBM912(int codepoint) -
isIBM913
private static boolean isIBM913(int codepoint) -
isCompoundS1
private static boolean isCompoundS1(int codepoint) -
isISO8859_14
private static boolean isISO8859_14(int codepoint) -
isIBM923
private static boolean isIBM923(int codepoint) -
findNextEsc
-
getState
private static byte getState(int codepoint) -
findStateFromEscSeq
-
LoadConverters
private void LoadConverters() -
newDecoder
- Specified by:
newDecoder
in classCharset
-
newEncoder
- Specified by:
newEncoder
in classCharset
-
getUnicodeSetImpl
Description copied from class:CharsetICU
This follows ucnv.c method ucnv_detectUnicodeSignature() to detect the start of the stream for example U+FEFF (the Unicode BOM/signature character) that can be ignored. Detects Unicode signature byte sequences at the start of the byte stream and returns number of bytes of the BOM of the indicated Unicode charset. 0 is returned when no Unicode signature is recognized.- Specified by:
getUnicodeSetImpl
in classCharsetICU
-