Search results
461 packages found
Tiny JavaScript tokenizer.
A promise based streaming tokenizer
A tool set for CSS: fast detailed parser (CSS → AST), walker (AST traversal), generator (AST → CSS) and lexer (validation and matching) based on specs and browser implementations
Chevrotain is a high performance fault tolerant javascript parsing DSL for building recursive decent parsers
General natural language (tokenizing, stemming (English, Russian, Spanish), part-of-speech tagging, sentiment analysis, classification, inflection, phonetics, tfidf, WordNet, jaro-winkler, Levenshtein distance, Dice's Coefficient) facilities for node.
- natural language processing
- artifical intelligence
- statistics
- Porter stemmer
- Lancaster stemmer
- tokenizer
- bigram
- trigram
- quadgram
- ngram
- stemmer
- bayes
- classifier
- phonetic
- View more
small commonmark compliant markdown parser with positional info and concrete tokens
React typeahead with Bootstrap styling
- auto complete
- auto suggest
- auto-complete
- auto-suggest
- autocomplete
- autosuggest
- bootstrap
- bootstrap tokenizer
- bootstrap typeahead
- bootstrap-tokenizer
- bootstrap-typeahead
- react
- react autocomplete
- react autosuggest
- View more
A pure JavaScript implementation of a BPE tokenizer (Encoder/Decoder) for GPT-2 / GPT-3 / GPT-4 and other OpenAI models
- BPE
- encoder
- decoder
- tokenizer
- GPT
- GPT-2
- GPT-3
- GPT-3.5
- GPT-4
- GPT-4o
- NLP
- Natural Language Processing
- Text Generation
- OpenAI
- View more
Tokenizes a string that represents a regular expression.
stream-json is the micro-library of Node.js stream components for creating custom JSON processing pipelines with a minimal memory footprint. It can parse JSON files far exceeding available memory streaming individual primitives using a SAX-inspired API. I
a lightweight no-dependency fork of transformers.js (only tokenizers)
- llama
- llama2
- llama3
- chatgpt
- mistral
- tokenizer
- transformers
- transformers.js
- gpt4
- gpt4o
- gpt3.5
- vicuna
- chatglm
- baichuan
Streams CSV files.
Parse PHP code from JS and returns its AST
Rich text and markdown tokenization made easy.
Fork of css-tree with all but animatable CSS properties removed
HTML and CSS lexer aimed at code with fatal errors, accepts mixed coding languages
Is given character suitable to be in an HTML attribute's name?
Does an HTML tag start at given position?