Index

A B C D E F G H I L M N O P R S T U V W _ 
All Classes and Interfaces|All Packages|Constant Field Values|Serialized Form

A

add(String) - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
Add n-gram to profile
add(String) - Method in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
addChar(char) - Method in class com.optimaize.langdetect.cybozu.util.NGram
 
addCharSequence(LangProfile, CharSequence) - Static method in class com.optimaize.langdetect.cybozu.util.Util
 
addGram(String) - Method in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
Shortcut for addGram(ngram, 1).
addGram(String, int) - Method in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
If the builder already has this ngram, the given frequency is added to the current count.
addOpt(String, String, String) - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
 
addText(CharSequence) - Method in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
In order to use this you must set the LanguageProfileBuilder.ngramExtractor first.
affixFactor(double) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
Sets prefixFactor() and suffixFactor() both to the given value.
alpha - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
alpha - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
alpha(double) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
 
ALPHA_DEFAULT - Static variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
ALPHA_WIDTH - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
TODO document what this is for, and why that value is chosen.
append(char) - Method in class com.optimaize.langdetect.text.TextObject
 
append(Reader) - Method in class com.optimaize.langdetect.text.TextObject
Append the target text for language detection.
append(CharSequence) - Method in class com.optimaize.langdetect.text.TextObject
Append the target text for language detection.
append(CharSequence, int, int) - Method in class com.optimaize.langdetect.text.TextObject
 
applyPadding(CharSequence) - Method in class com.optimaize.langdetect.ngram.NgramExtractor
 
arglist - Variable in class com.optimaize.langdetect.cybozu.CommandLineInterface
 
assignLang(String) - Static method in class com.optimaize.langdetect.i18n.LdLocale
 

B

backwards() - Static method in class com.optimaize.langdetect.ngram.NgramExtractors
The old way of doing n-grams.
BACKWARDS - Static variable in class com.optimaize.langdetect.ngram.NgramExtractors
 
BackwardsCompatibleNgramFilter - Class in com.optimaize.langdetect.ngram
Filters those that were not generated by the old n-gram generator.
BackwardsCompatibleNgramFilter() - Constructor for class com.optimaize.langdetect.ngram.BackwardsCompatibleNgramFilter
 
BASE_FREQ - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
TODO document what this is for, and why that value is chosen.
batchTest() - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
Batch Test of Language Detection (--batchtest option)
buf_ - Variable in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
build() - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
 
build() - Method in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
 
build() - Method in class com.optimaize.langdetect.text.TextObjectFactoryBuilder
 
BuiltInLanguages - Class in com.optimaize.langdetect.profiles
 
BuiltInLanguages() - Constructor for class com.optimaize.langdetect.profiles.BuiltInLanguages
 
BUNDLE_NAME - Static variable in class com.optimaize.langdetect.cybozu.util.Messages
 

C

capitalword_ - Variable in class com.optimaize.langdetect.cybozu.util.NGram
 
charAt(int) - Method in class com.optimaize.langdetect.text.TextObject
 
CharNormalizer - Class in com.optimaize.langdetect.cybozu.util
Some character normalization (and exclusion) functionality.
CharNormalizer() - Constructor for class com.optimaize.langdetect.cybozu.util.CharNormalizer
 
CharNormalizerTextFilterImpl - Class in com.optimaize.langdetect.text
Deprecated.
can't be used because it would be a big loss to not inline this code.
CharNormalizerTextFilterImpl() - Constructor for class com.optimaize.langdetect.text.CharNormalizerTextFilterImpl
Deprecated.
 
CJK_CLASS - Static variable in class com.optimaize.langdetect.cybozu.util.CharNormalizer
CJK Kanji Normalization Mapping
cjk_map - Static variable in class com.optimaize.langdetect.cybozu.util.CharNormalizer
 
clear() - Method in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
closeQuietly(Closeable) - Static method in class com.optimaize.langdetect.frma.IOUtils
Deprecated.
use java7 closeable
closeTag(LangProfile) - Method in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
com.optimaize.langdetect - package com.optimaize.langdetect
 
com.optimaize.langdetect.cybozu - package com.optimaize.langdetect.cybozu
Original language detection classes from https://code.google.com/p/language-detection/
com.optimaize.langdetect.cybozu.util - package com.optimaize.langdetect.cybozu.util
Provides the utility classes for language detection.
com.optimaize.langdetect.frma - package com.optimaize.langdetect.frma
 
com.optimaize.langdetect.i18n - package com.optimaize.langdetect.i18n
 
com.optimaize.langdetect.ngram - package com.optimaize.langdetect.ngram
Provides functionality for handling n-grams.
com.optimaize.langdetect.profiles - package com.optimaize.langdetect.profiles
Provides functionality for loading, storing and creating LanguageProfiles.
com.optimaize.langdetect.profiles.util - package com.optimaize.langdetect.profiles.util
 
com.optimaize.langdetect.text - package com.optimaize.langdetect.text
Provides functionality for concatenating and cleaning text that is used as a) learning text to produce com.optimaize.langdetect.LanguageProfiles b) for the text for which the language is to be guessed.
CommandLineInterface - Class in com.optimaize.langdetect.cybozu
LangDetect Command Line Interface.
CommandLineInterface() - Constructor for class com.optimaize.langdetect.cybozu.CommandLineInterface
 
CommonTextObjectFactories - Class in com.optimaize.langdetect.text
Contains some standard TextObjectFactorys ready to use for common use cases.
CommonTextObjectFactories() - Constructor for class com.optimaize.langdetect.text.CommonTextObjectFactories
 
compareTo(DetectedLanguage) - Method in class com.optimaize.langdetect.DetectedLanguage
See class header.
CONV_THRESHOLD - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
TODO document what this is for, and why that value is chosen.
convert(LangProfile) - Static method in class com.optimaize.langdetect.profiles.OldLangProfileConverter
 
count() - Method in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
count_ - Variable in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
countByScript(CharSequence) - Method in class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
 
create() - Method in class com.optimaize.langdetect.text.TextObjectFactory
 
create(NgramExtractor) - Static method in class com.optimaize.langdetect.LanguageDetectorBuilder
 
create(Collection<LanguageProfile>, Collection<Integer>) - Static method in class com.optimaize.langdetect.NgramFrequencyData
 

D

DEFAULT_ALPHA - Static variable in class com.optimaize.langdetect.cybozu.CommandLineInterface
smoothing default parameter (ELE)
DEFAULT_SEED - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
This is used when no custom seed was passed in.
detect(CharSequence) - Method in interface com.optimaize.langdetect.LanguageDetector
Returns the best detected language if the algorithm is very confident.
detect(CharSequence) - Method in class com.optimaize.langdetect.LanguageDetectorImpl
 
detectBlock(CharSequence) - Method in class com.optimaize.langdetect.LanguageDetectorImpl
 
detectBlockLongText(List<String>) - Method in class com.optimaize.langdetect.LanguageDetectorImpl
This is the original algorithm used for all text length.
detectBlockShortText(Map<String, Integer>) - Method in class com.optimaize.langdetect.LanguageDetectorImpl
 
DetectedLanguage - Class in com.optimaize.langdetect
Holds information about a detected language: the locale (language) and the probability.
DetectedLanguage(LdLocale, double) - Constructor for class com.optimaize.langdetect.DetectedLanguage
 
detectLang() - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
Language detection test for each file (--detectlang option)

E

equals(Object) - Method in class com.optimaize.langdetect.i18n.LdLocale
 
equals(Object) - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
extractCountedGrams(CharSequence) - Method in class com.optimaize.langdetect.ngram.NgramExtractor
 
extractGrams(CharSequence) - Method in class com.optimaize.langdetect.ngram.NgramExtractor
Creates the n-grams for a given text in the order they occur.
extractNGrams(CharSequence, OldNgramExtractor.Filter) - Static method in class com.optimaize.langdetect.ngram.OldNgramExtractor
Deprecated.

F

filter - Variable in class com.optimaize.langdetect.ngram.NgramExtractor
 
filter(NgramFilter) - Method in class com.optimaize.langdetect.ngram.NgramExtractor
 
filter(CharSequence) - Method in class com.optimaize.langdetect.text.CharNormalizerTextFilterImpl
Deprecated.
 
filter(CharSequence) - Method in class com.optimaize.langdetect.text.MultiTextFilter
 
filter(CharSequence) - Method in class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
 
filter(CharSequence) - Method in interface com.optimaize.langdetect.text.TextFilter
 
filter(CharSequence) - Method in class com.optimaize.langdetect.text.UrlTextFilter
 
filters - Variable in class com.optimaize.langdetect.text.MultiTextFilter
 
findMost(Map<Character.UnicodeScript, Long>) - Method in class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
 
forDetectingOnLargeText() - Static method in class com.optimaize.langdetect.text.CommonTextObjectFactories
 
forDetectingShortCleanText() - Static method in class com.optimaize.langdetect.text.CommonTextObjectFactories
 
forIndexing() - Static method in class com.optimaize.langdetect.text.CommonTextObjectFactories
 
forIndexingCleanText() - Static method in class com.optimaize.langdetect.text.CommonTextObjectFactories
 
forText(CharSequence) - Method in class com.optimaize.langdetect.text.TextObjectFactory
 
forThreshold(double) - Static method in class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
If a script has less than this fraction of content compared to the most used one, its text is removed.
freq - Variable in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
Key = ngram, value = count.
FREQ_PATTERN - Static variable in class com.optimaize.langdetect.frma.LangProfileReader
 
fromString(String) - Static method in class com.optimaize.langdetect.i18n.LdLocale
 

G

generate(String, File) - Static method in class com.optimaize.langdetect.frma.GenProfile
Loads a text file and generate a language profile from its content.
generateProfile() - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
Generate Language Profile from a text file.
GenProfile - Class in com.optimaize.langdetect.cybozu
Load Wikipedia's abstract XML as corpus and generate its language profile in JSON format.
GenProfile - Class in com.optimaize.langdetect.frma
Generate a language profile from any given text file.
GenProfile() - Constructor for class com.optimaize.langdetect.cybozu.GenProfile
 
GenProfile() - Constructor for class com.optimaize.langdetect.frma.GenProfile
 
get(int) - Method in class com.optimaize.langdetect.cybozu.util.NGram
TODO this method has some weird, undocumented behavior to ignore ngrams with upper case.
getFreq() - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
 
getFrequency(String) - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
 
getFrequency(String) - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
getGramLengths() - Method in class com.optimaize.langdetect.ngram.NgramExtractor
 
getGramLengths() - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
Tells what the n in n-grams are used here.
getGramLengths() - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
getInstance() - Static method in class com.optimaize.langdetect.ngram.BackwardsCompatibleNgramFilter
 
getInstance() - Static method in class com.optimaize.langdetect.ngram.StandardNgramFilter
 
getInstance() - Static method in class com.optimaize.langdetect.text.UrlTextFilter
 
getLanguage() - Method in class com.optimaize.langdetect.i18n.LdLocale
 
getLanguage(int) - Method in class com.optimaize.langdetect.NgramFrequencyData
 
getLanguageList() - Method in class com.optimaize.langdetect.NgramFrequencyData
 
getLanguages() - Static method in class com.optimaize.langdetect.profiles.BuiltInLanguages
 
getLocale() - Method in class com.optimaize.langdetect.DetectedLanguage
 
getLocale() - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
 
getLocale() - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
getMaxGramCount(int) - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
Tells how often the n-gram with the highest amount of occurrences used in this profile occurred.
getMaxGramCount(int) - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
getMinGramCount(int) - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
Tells how often the n-gram with the lowest amount of occurrences used in this profile occurred.
getMinGramCount(int) - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
getName() - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
 
getNumGramOccurrences(int) - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
Tells how often all n-grams of a certain length occurred, combined.
getNumGramOccurrences(int) - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
getNumGrams() - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
Tells how many n-grams there are for all n-gram sizes combined.
getNumGrams() - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
getNumGrams(int) - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
Tells how many different n-grams there are for a certain n-gram size.
getNumGrams(int) - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
getNWords() - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
 
getParamDouble(String, double) - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
Returns the double, or the default is absent.
getParamLongOrNull(String) - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
 
getProbabilities(CharSequence) - Method in interface com.optimaize.langdetect.LanguageDetector
Returns all languages with at least some likeliness.
getProbabilities(CharSequence) - Method in class com.optimaize.langdetect.LanguageDetectorImpl
 
getProbabilities(String) - Method in class com.optimaize.langdetect.NgramFrequencyData
Don't modify this data structure! (Can't make array immutable...)
getProbability() - Method in class com.optimaize.langdetect.DetectedLanguage
 
getRegion() - Method in class com.optimaize.langdetect.i18n.LdLocale
 
getScript() - Method in class com.optimaize.langdetect.i18n.LdLocale
 
getShortTextLanguages() - Static method in class com.optimaize.langdetect.profiles.BuiltInLanguages
 
getString(String) - Static method in class com.optimaize.langdetect.cybozu.util.Messages
 
gramLength(int) - Static method in class com.optimaize.langdetect.ngram.NgramExtractor
 
gramLengths - Variable in class com.optimaize.langdetect.ngram.NgramExtractor
 
gramLengths(Integer...) - Static method in class com.optimaize.langdetect.ngram.NgramExtractor
 
grams_ - Variable in class com.optimaize.langdetect.cybozu.util.NGram
 
guessNumDistinctiveGrams(int, int) - Static method in class com.optimaize.langdetect.ngram.NgramExtractor
This is trying to be smart.

H

hashCode() - Method in class com.optimaize.langdetect.i18n.LdLocale
 
hashCode() - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
hasParam(String) - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
 

I

increment(Map<Character.UnicodeScript, Long>, Character.UnicodeScript) - Method in class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
 
initProbability() - Method in class com.optimaize.langdetect.LanguageDetectorImpl
Initialize the map of language probabilities.
INSTANCE - Static variable in class com.optimaize.langdetect.ngram.BackwardsCompatibleNgramFilter
 
INSTANCE - Static variable in class com.optimaize.langdetect.ngram.StandardNgramFilter
 
INSTANCE - Static variable in class com.optimaize.langdetect.text.UrlTextFilter
 
internalReader - Static variable in class com.optimaize.langdetect.profiles.LanguageProfileReader
 
IOUtils - Class in com.optimaize.langdetect.frma
Deprecated.
IOUtils() - Constructor for class com.optimaize.langdetect.frma.IOUtils
Deprecated.
Private constructor to prevent instantiation.
isSpace() - Method in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
iterateGrams() - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
Iterates all ngram strings with frequency.
iterateGrams() - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
iterateGrams(int) - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
Iterates all gramLength-gram strings with frequency.
iterateGrams(int) - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
ITERATION_LIMIT - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
TODO document what this is for, and why that value is chosen.

L

langlist - Variable in class com.optimaize.langdetect.NgramFrequencyData
All the loaded languages, in exactly the same order as the data is in the double[] in wordLangProbMap.
LangProfile - Class in com.optimaize.langdetect.cybozu.util
Deprecated.
replaced by LanguageProfile
LangProfile() - Constructor for class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
Constructor for JSONIC
LangProfile(String) - Constructor for class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
Normal Constructor
LangProfileReader - Class in com.optimaize.langdetect.frma
Reads LangProfiles.
LangProfileReader() - Constructor for class com.optimaize.langdetect.frma.LangProfileReader
 
LangProfileWriter - Class in com.optimaize.langdetect.frma
Writes a LangProfile to an output stream (file).
LangProfileWriter() - Constructor for class com.optimaize.langdetect.frma.LangProfileWriter
 
langsAdded - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
language - Variable in class com.optimaize.langdetect.i18n.LdLocale
 
LanguageDetector - Interface in com.optimaize.langdetect
Guesses the language of an input string or text.
LanguageDetectorBuilder - Class in com.optimaize.langdetect
Builder for LanguageDetector.
LanguageDetectorBuilder(NgramExtractor) - Constructor for class com.optimaize.langdetect.LanguageDetectorBuilder
 
LanguageDetectorImpl - Class in com.optimaize.langdetect
This class is immutable and thus thread-safe.
LanguageDetectorImpl(NgramFrequencyData, double, Optional<Long>, int, double, double, double, double, Map<LdLocale, Double>, NgramExtractor) - Constructor for class com.optimaize.langdetect.LanguageDetectorImpl
LanguageLister - Class in com.optimaize.langdetect.profiles.util
This is just a utility to update the code with the existing languages.
LanguageLister() - Constructor for class com.optimaize.langdetect.profiles.util.LanguageLister
 
languagePriorities(Map<LdLocale, Double>) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
TODO document exactly.
LanguageProfile - Interface in com.optimaize.langdetect.profiles
A language profile knows the locale (language), and contains the n-grams and some statistics.
LanguageProfileBuilder - Class in com.optimaize.langdetect.profiles
Builder for LanguageProfile.
LanguageProfileBuilder(LdLocale) - Constructor for class com.optimaize.langdetect.profiles.LanguageProfileBuilder
 
LanguageProfileBuilder(String) - Constructor for class com.optimaize.langdetect.profiles.LanguageProfileBuilder
Deprecated.
LanguageProfileImpl - Class in com.optimaize.langdetect.profiles
This class is immutable.
LanguageProfileImpl(LdLocale, Map<Integer, Map<String, Integer>>) - Constructor for class com.optimaize.langdetect.profiles.LanguageProfileImpl
Use the builder.
LanguageProfileImpl.Stats - Class in com.optimaize.langdetect.profiles
 
LanguageProfileReader - Class in com.optimaize.langdetect.profiles
LanguageProfileReader() - Constructor for class com.optimaize.langdetect.profiles.LanguageProfileReader
 
languageProfiles - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
LanguageProfileWriter - Class in com.optimaize.langdetect.profiles
Writes a LanguageProfile to an output stream or file.
LanguageProfileWriter() - Constructor for class com.optimaize.langdetect.profiles.LanguageProfileWriter
 
languages - Static variable in class com.optimaize.langdetect.profiles.BuiltInLanguages
 
langWeightingMap - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
LATIN1_EXCLUDED - Static variable in class com.optimaize.langdetect.cybozu.util.CharNormalizer
 
LdLocale - Class in com.optimaize.langdetect.i18n
A language-detector implementation of a Locale, similar to the java.util.Locale.
LdLocale(String, Optional<String>, Optional<String>) - Constructor for class com.optimaize.langdetect.i18n.LdLocale
 
length() - Method in class com.optimaize.langdetect.text.TextObject
 
LESS_FREQ_RATIO - Static variable in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
Explanation by example: If the most frequent n-gram occurs 1 mio times, then 1'000'000 / this (100'000) = 10.
load(String, File) - Static method in class com.optimaize.langdetect.cybozu.GenProfile
Load Wikipedia abstract database file and generate its language profile
locale - Variable in class com.optimaize.langdetect.DetectedLanguage
 
locale - Variable in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
 
locale - Variable in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
logger - Static variable in class com.optimaize.langdetect.cybozu.GenProfile
 
logger - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
looksLikeGeoCode3166_1(String) - Static method in class com.optimaize.langdetect.i18n.LdLocale
 
looksLikeGeoCodeNumeric(String) - Static method in class com.optimaize.langdetect.i18n.LdLocale
 
looksLikeLanguageProfileFile(File) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
 
looksLikeLanguageProfileName(String) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
 
looksLikeScriptCode(String) - Static method in class com.optimaize.langdetect.i18n.LdLocale
 

M

MAIL_REGEX - Static variable in class com.optimaize.langdetect.text.UrlTextFilter
 
main(String[]) - Static method in class com.optimaize.langdetect.cybozu.CommandLineInterface
Command Line Interface
main(String[]) - Static method in class com.optimaize.langdetect.profiles.util.LanguageLister
 
makeDetector() - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
Using all language profiles from the given directory.
makeInternalPrioMap(Map<LdLocale, Double>, List<LdLocale>) - Static method in class com.optimaize.langdetect.cybozu.util.Util
 
makePathForClassLoader(String, String) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
 
makeProfileFileName(LdLocale) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
 
makeStats(Map<Integer, Map<String, Integer>>) - Static method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
maxGramCounts - Variable in class com.optimaize.langdetect.profiles.LanguageProfileImpl.Stats
Key = gram length (1-3 or so).
maxTextLength - Variable in class com.optimaize.langdetect.text.TextObject
 
maxTextLength - Variable in class com.optimaize.langdetect.text.TextObjectFactory
 
maxTextLength - Variable in class com.optimaize.langdetect.text.TextObjectFactoryBuilder
 
maxTextLength(int) - Method in class com.optimaize.langdetect.text.TextObjectFactoryBuilder
 
Messages - Class in com.optimaize.langdetect.cybozu.util
This is Messages class generated by Eclipse automatically.
Messages() - Constructor for class com.optimaize.langdetect.cybozu.util.Messages
 
minGramCounts - Variable in class com.optimaize.langdetect.profiles.LanguageProfileImpl.Stats
Key = gram length (1-3 or so).
minimalConfidence - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
minimalConfidence - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
minimalConfidence(double) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
LanguageDetector.detect(java.lang.CharSequence) returns a language if the best detected language has at least this probability.
minimalFrequency - Variable in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
 
minimalFrequency(int) - Method in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
 
MINIMUM_FREQ - Static variable in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
n-grams that occur less than this often can be removed using omitLessFreq().
MultiTextFilter - Class in com.optimaize.langdetect.text
Groups multiple TextFilters as one and runs them in the given order.
MultiTextFilter(List<TextFilter>) - Constructor for class com.optimaize.langdetect.text.MultiTextFilter
 

N

N_GRAM - Static variable in class com.optimaize.langdetect.cybozu.util.NGram
ngrams are created from 1gram to this amount, currently 2grams and 3grams.
N_TRIAL - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
TODO document what this is for, and why that value is chosen.
N_WORDS_PATTERN - Static variable in class com.optimaize.langdetect.frma.LangProfileReader
 
name - Variable in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
The language name (identifier).
NAME_PATTERN - Static variable in class com.optimaize.langdetect.frma.LangProfileReader
 
NGram - Class in com.optimaize.langdetect.cybozu.util
TODO document.
NGram() - Constructor for class com.optimaize.langdetect.cybozu.util.NGram
 
ngramExtractor - Static variable in class com.optimaize.langdetect.cybozu.util.Util
 
ngramExtractor - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
ngramExtractor - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
ngramExtractor - Variable in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
 
ngramExtractor(NgramExtractor) - Method in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
 
NgramExtractor - Class in com.optimaize.langdetect.ngram
Class for extracting n-grams out of a text.
NgramExtractor(List<Integer>, NgramFilter, Character) - Constructor for class com.optimaize.langdetect.ngram.NgramExtractor
 
NgramExtractors - Class in com.optimaize.langdetect.ngram
Provides easy access to commonly used NgramExtractor configs.
NgramExtractors() - Constructor for class com.optimaize.langdetect.ngram.NgramExtractors
 
NgramFilter - Interface in com.optimaize.langdetect.ngram
Filters out some undesired n-grams.
ngramFrequencyData - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
NgramFrequencyData - Class in com.optimaize.langdetect
Contains frequency information for n-grams coming from multiple LanguageProfiles.
NgramFrequencyData(Map<String, double[]>, List<LdLocale>) - Constructor for class com.optimaize.langdetect.NgramFrequencyData
 
ngrams - Variable in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
 
ngrams - Variable in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
normalize(char) - Static method in class com.optimaize.langdetect.cybozu.util.CharNormalizer
Character Normalization (and exclusion).
normalizeProb(double[]) - Static method in class com.optimaize.langdetect.cybozu.util.Util
normalize probabilities and check convergence by the maximum probability
numOccurrences - Variable in class com.optimaize.langdetect.profiles.LanguageProfileImpl.Stats
Key = gram length (1-3 or so).
nWords - Variable in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
Tells how many occurrences of n-grams exist per gram length.

O

OldLangProfileConverter - Class in com.optimaize.langdetect.profiles
Converts an old LangProfile to a new LanguageProfile.
OldLangProfileConverter() - Constructor for class com.optimaize.langdetect.profiles.OldLangProfileConverter
 
OldNgramExtractor - Class in com.optimaize.langdetect.ngram
Deprecated.
OldNgramExtractor() - Constructor for class com.optimaize.langdetect.ngram.OldNgramExtractor
Deprecated.
 
OldNgramExtractor.Filter - Interface in com.optimaize.langdetect.ngram
Deprecated.
 
omitLessFreq() - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
Removes ngrams that occur fewer times than MINIMUM_FREQ to get rid of rare ngrams.
opt_with_value - Variable in class com.optimaize.langdetect.cybozu.CommandLineInterface
for Command line easy parser
opt_without_value - Variable in class com.optimaize.langdetect.cybozu.CommandLineInterface
 

P

parse(String[]) - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
Command line easy parser
prefixFactor - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
prefixFactor - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
prefixFactor(double) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
To weight n-grams that are on the left border of a word differently from n-grams in the middle of words, assign a value here.
priorMap - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
User-defined language priorities, in the same order as langlist.
probability - Variable in class com.optimaize.langdetect.DetectedLanguage
 
PROBABILITY_SORTING_COMPARATOR - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
probabilityThreshold - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
probabilityThreshold - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
probabilityThreshold(double) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
LanguageDetector.getProbabilities(java.lang.CharSequence) does not return languages with less probability than this.
PROFILES_DIR - Static variable in class com.optimaize.langdetect.profiles.LanguageProfileReader
 

R

read(File) - Method in class com.optimaize.langdetect.frma.LangProfileReader
Reads a LangProfile from a File in UTF-8.
read(File) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
Reads a LanguageProfile from a File in UTF-8.
read(InputStream) - Method in class com.optimaize.langdetect.frma.LangProfileReader
Reads a LangProfile from an InputStream in UTF-8.
read(InputStream) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
Reads a LanguageProfile from an InputStream in UTF-8.
read(ClassLoader, String, Collection<String>) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
Load profiles from the classpath in a specific directory.
read(String, Collection<String>) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
Same as LanguageProfileReader.read(ClassLoader, String, java.util.Collection) using the class loader of this class.
read(Collection<String>) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
Same as LanguageProfileReader.read(ClassLoader, String, java.util.Collection) using the class loader of this class, and the default profiles directory of this library.
readAll() - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
Deprecated.
renamed to readAllBuiltIn()
readAll(File) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
Loads all profiles from the specified directory.
readAllBuiltIn() - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
Reads all built-in language profiles from the "languages" folder (shipped with the jar).
readBuiltIn(LdLocale) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
 
readBuiltIn(Collection<LdLocale>) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
 
readFilesFromClassPathFolder(String) - Static method in class com.optimaize.langdetect.profiles.util.LanguageLister
 
region - Variable in class com.optimaize.langdetect.i18n.LdLocale
 
remove(CharSequence, Set<Character.UnicodeScript>) - Method in class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
 
RemoveMinorityScriptsTextFilter - Class in com.optimaize.langdetect.text
Removes text written in scripts that are not the dominant script of the text.
RemoveMinorityScriptsTextFilter(double) - Constructor for class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
 
removeNgramsWithLessFrequency() - Method in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
 
requireParamString(String) - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
 
RESOURCE_BUNDLE - Static variable in class com.optimaize.langdetect.cybozu.util.Messages
 

S

script - Variable in class com.optimaize.langdetect.i18n.LdLocale
 
searchFile(File, String) - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
File search (easy glob)
seed - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
seed - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
seed(long) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
 
seed(Optional<Long>) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
 
serialVersionUID - Static variable in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
 
setFreq(Map<String, Integer>) - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
 
setName(String) - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
 
setNWords(int[]) - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
 
setTag(String) - Method in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
shortTextAlgorithm - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
shortTextAlgorithm - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
shortTextAlgorithm(int) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
Defaults to 0, which means don't use this feature.
shortTextLanguages - Static variable in class com.optimaize.langdetect.profiles.BuiltInLanguages
 
sortProbability(double[]) - Method in class com.optimaize.langdetect.LanguageDetectorImpl
Returns the detected languages sorted by probabilities descending.
standard() - Static method in class com.optimaize.langdetect.ngram.NgramExtractors
The new standard n-gram algorithm.
STANDARD - Static variable in class com.optimaize.langdetect.ngram.NgramExtractors
 
StandardNgramFilter - Class in com.optimaize.langdetect.ngram
Filters what is generally not desired.
StandardNgramFilter() - Constructor for class com.optimaize.langdetect.ngram.StandardNgramFilter
 
stats - Variable in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
Stats(Map<Integer, Long>, Map<Integer, Long>, Map<Integer, Long>) - Constructor for class com.optimaize.langdetect.profiles.LanguageProfileImpl.Stats
 
stringBuilder - Variable in class com.optimaize.langdetect.text.TextObject
 
subSequence(int, int) - Method in class com.optimaize.langdetect.text.TextObject
 
suffixFactor - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
suffixFactor - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
suffixFactor(double) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
Defaults to 1.0, which means don't use this feature.

T

tag_ - Variable in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
TagExtractor - Class in com.optimaize.langdetect.cybozu.util
TagExtractor is a class which extracts inner texts of specified tag.
TagExtractor(String, int) - Constructor for class com.optimaize.langdetect.cybozu.util.TagExtractor
 
target_ - Variable in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
textFilter - Variable in class com.optimaize.langdetect.text.TextObject
 
textFilter - Variable in class com.optimaize.langdetect.text.TextObjectFactory
 
TextFilter - Interface in com.optimaize.langdetect.text
Allows to filter content from a text to be ignored for the n-gram analysis.
textFilters - Variable in class com.optimaize.langdetect.text.TextObjectFactoryBuilder
 
TextObject - Class in com.optimaize.langdetect.text
A convenient text object implementing CharSequence and Appendable.
TextObject(TextFilter, int) - Constructor for class com.optimaize.langdetect.text.TextObject
 
textObjectFactory - Static variable in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
textObjectFactory - Static variable in class com.optimaize.langdetect.frma.GenProfile
 
TextObjectFactory - Class in com.optimaize.langdetect.text
Factory for TextObjects.
TextObjectFactory(TextFilter, int) - Constructor for class com.optimaize.langdetect.text.TextObjectFactory
 
TextObjectFactoryBuilder - Class in com.optimaize.langdetect.text
Builder for TextObjectFactory.
TextObjectFactoryBuilder() - Constructor for class com.optimaize.langdetect.text.TextObjectFactoryBuilder
 
textPadding - Variable in class com.optimaize.langdetect.ngram.NgramExtractor
 
textPadding(char) - Method in class com.optimaize.langdetect.ngram.NgramExtractor
To ensure having border grams, this character is added to the left and right of the text.
threshold - Variable in class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
 
threshold_ - Variable in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
toString() - Method in class com.optimaize.langdetect.DetectedLanguage
 
toString() - Method in class com.optimaize.langdetect.i18n.LdLocale
The output of this can be fed to the fromString() method.
toString() - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
toString() - Method in class com.optimaize.langdetect.text.TextObject
 

U

unicodeEncode(String) - Static method in class com.optimaize.langdetect.cybozu.util.Util
unicode encoding (for verbose mode)
updateLangProb(double[], String, int, double) - Method in class com.optimaize.langdetect.LanguageDetectorImpl
update language probabilities with N-gram string(N=1,2,3)
URL_REGEX - Static variable in class com.optimaize.langdetect.text.UrlTextFilter
 
UrlTextFilter - Class in com.optimaize.langdetect.text
Removes URLs and email addresses from the text.
UrlTextFilter() - Constructor for class com.optimaize.langdetect.text.UrlTextFilter
 
use(String) - Method in class com.optimaize.langdetect.ngram.BackwardsCompatibleNgramFilter
 
use(String) - Method in interface com.optimaize.langdetect.ngram.NgramFilter
 
use(String) - Method in interface com.optimaize.langdetect.ngram.OldNgramExtractor.Filter
Deprecated.
Allows to skip some n-grams.
use(String) - Method in class com.optimaize.langdetect.ngram.StandardNgramFilter
 
Util - Class in com.optimaize.langdetect.cybozu.util
A place for sharing code.
Util() - Constructor for class com.optimaize.langdetect.cybozu.util.Util
 

V

values - Variable in class com.optimaize.langdetect.cybozu.CommandLineInterface
 

W

withProfile(LanguageProfile) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
 
withProfiles(Iterable<LanguageProfile>) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
 
withTextFilter(TextFilter) - Method in class com.optimaize.langdetect.text.TextObjectFactoryBuilder
Adds the given TextFilter to be run on TextObject.append(java.io.Reader) methods.
wordLangProbMap - Variable in class com.optimaize.langdetect.NgramFrequencyData
Key = ngram Value = array with probabilities per loaded language, in the same order as langlist.
wordProbToString(double[], List<LdLocale>) - Static method in class com.optimaize.langdetect.cybozu.util.Util
 
write(LanguageProfile, OutputStream) - Method in class com.optimaize.langdetect.profiles.LanguageProfileWriter
Writes a LanguageProfile to an OutputStream in UTF-8.
write(LangProfile, OutputStream) - Method in class com.optimaize.langdetect.frma.LangProfileWriter
Writes a LangProfile to an OutputStream in UTF-8.
writeToDirectory(LanguageProfile, File) - Method in class com.optimaize.langdetect.profiles.LanguageProfileWriter
Writes a LanguageProfile to a folder using the language name as the file name.

_

_extractCounted(CharSequence, int, int, Map<String, Integer>) - Method in class com.optimaize.langdetect.ngram.NgramExtractor
 
A B C D E F G H I L M N O P R S T U V W _ 
All Classes and Interfaces|All Packages|Constant Field Values|Serialized Form