Smart import
CSV, Excel, or PDF. Preview columns or pages, choose your source, and start processing.
SA7BY® is an NLP lab for tabular corpora: upload Excel or CSV, choose the column that holds your text, and run tokenization, lemmas, POS, and dependencies in the background, then explore semantic analysis with WOLF and FastText.
Import a CSV, Excel, XML, or PDF file. Within minutes, every word is annotated, every sentence is in context. Explore co-occurrences, n-grams, word proximity, and semantic relations. Export your results to CSV at any time.
Built for researchers in linguistics, textometry, and digital humanities.
CSV, Excel, or PDF. Preview columns or pages, choose your source, and start processing.
All words in your corpus sorted by frequency or alphabetically. Multi-POS filter, interactive word cloud, and CSV export.
Every occurrence of a word with its left and right context. KWIC sort (L1, L2, R1, R2), configurable context size, and reading mode.
Filter your corpus by author, date, genre, or any column from the original file. Multi-select, cascading filters, persisted across pages.
Discover words that frequently appear together. Five statistical measures and configurable span to analyze lexical associations.
Identify recurring expressions and word sequences, from 2 to 5 words. Toggle between lemma and surface form.
Find sentences where two words appear near each other. Control distance, order, and filter by grammatical category.
Automatically detect persons, locations, and organizations in your corpus. Filter by type and view sentences.
Build word sequences with POS, lemmas, and gaps. Find all linguistic patterns in your corpus.
Add your own columns to the concordance table to classify each occurrence. Free text or dropdown, included in CSV export.
Discover words specific to your corpus compared to general French. Keyness score and effect size.
Drop your file, choose the text column or PDF pages. Linguistic processing starts automatically.
Browse the lexicon, concordances, n-grams, and co-occurrences. Filter by grammatical category and export to CSV.
Search for nearby words, explore lexical associations, and run semantic analysis with an interactive graph.
Questions? Check the FAQ or the documentation.