spacy-rt — spaCy's English pipeline, in Rust, in your browser

spaCy's English pipeline, rebuilt in Rust, running in this tab.

Tokenizer, POS tagger, dependency parser, lemmatizer, named-entity recognition, and 300-dimension word vectors — a from-scratch reimplementation of en_core_web_md, compiled to WebAssembly. The whole model is baked into this one HTML file. Turn your wifi off; everything below still runs.

LIVE · measured on your machine, this instant

Last parse

—ms

Throughput

—tok/s

Tokens

—

Round trips

0network

Entities & syntax

The full parse: named entities highlighted inline, the dependency tree drawn as arcs, and every token's lemma, part-of-speech, fine-grained tag and head. Edit the text — it re-parses as you type.

re-parses live while you type

Word similarity

Cosine similarity between two words' 300-d vectors — straight from the embedding table baked into this file. The numbers are identical to spaCy's en_core_web_md: king/queen lands at 0.38 here exactly as it does there.

Text → embedding

Averages the token vectors into a single document embedding (doc.vector) — the vector you'd feed a clustering or semantic-search index. Each cell below is one of the 300 dimensions.

Lemmatizer

Reduces every token to its dictionary base form — running→run, better→well, mice→mouse — using the same rules and exception tables as spaCy.