Trigrams for 500+ languages.
- What is this?
- When should I use this?
- Install
- Use
- API
- Data
- Compatibility
- Contribute
- Security
- License
This package exposes all trigrams for natural languages. Based on the most translated copyright-free document on this planet: UDHR.
When you are dealing with natural language detection.
This package is ESM only. In Node.js (version 18+), install with npm:
npm install trigramsIn Deno with esm.sh:
import {min, top} from 'https://esm.sh/trigrams@6'In browsers with esm.sh:
<script type="module"> import {min, top} from 'https://esm.sh/trigrams@6?bundle' </script>import {min, top} from 'trigrams' console.log((await min()).nld) console.log((await top()).pam)Yields:
[ // 300 top trigrams. ' ar', 'eer', 'tij', // … 'de ', 'an ', 'en ' // Most common trigram. ]{ // 300 top trigrams. 'isa': 6, 'upa': 6, 'i k': 6, // … 'ang': 273, 'ing': 282, 'ng ': 572 // Most common trigram with how often it was found. }This package exports the identifiers min and top. It exports no TypeScript types. There is no default export.
Get top trigrams.
Returns a promise resolving to arrays containing the top 300 trigrams sorted from least occurring to most occurring (Promise<Record<string, Array<string>>>).
Get top trigrams to occurrence counts.
Returns a promise resolving to an object mapping UDHR in Unicode codes to objects mapping the top 300 trigrams to occurrence counts (Promise<Record<string, Record<string, number>>>).
The trigrams are based on the unicode versions of the universal declaration of human rights.
The files are created from all paragraphs made available by wooorm/udhr and do not include headings and such.
Before creating trigrams,
- the unicode characters from
\u0021to\u0040(both including) are removed - one or more white space characters (
\s+) are replaced with a single space - alphabetic characters are lower cased (
[A-Z])
Additionally, the input is padded with two spaces on both sides.
This package is at least compatible with all maintained versions of Node.js. As of now, that is Node.js 18+. It also works in Deno and modern browsers.
Yes please! See How to Contribute to Open Source.
This package is safe.