Language identification Stay organized with collections Save and categorize content based on your preferences.
AI-generated Key Takeaways
ML Kit's language identification API allows you to determine the language of a text string, which is particularly helpful when dealing with user-generated content that often lacks language information.
It supports a wide range of languages, including romanized versions of Arabic, Bulgarian, Greek, Hindi, Japanese, Russian, and Chinese text.
The API can provide a simple language identification or a confidence distribution across multiple languages for a given text.
With ML Kit's on-device language identification API, you can determine the language of a string of text.
Language identification can be useful when working with user-provided text, which often doesn't come with any language information.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-10-29 UTC."],[],["ML Kit's on-device API identifies the language of text strings, supporting over one hundred languages, including romanized versions of Arabic, Bulgarian, Greek, Hindi, Japanese, Russian, and Chinese. It's designed for user-provided text that lacks language information. The API outputs language codes (e.g., `en` for English) or `und` for undetermined. For ambiguous cases, it provides a confidence distribution across multiple languages (e.g., `en` (0.52), `fr` (0.44)).\n"]]