hotchpotch

SQLite FTS5 extension for fast Japanese full-text search with 🛥Vaporetto / Vaporetto による高速な日本語全文検索を SQLite FTS5 で実現する拡張機能

11
0
100% credibility
Found Apr 28, 2026 at 11 stars -- GitGems finds repos before they trend. Get early access to the next one.
Sign Up Free
AI Analysis
Rust
AI Summary

This project enhances SQLite databases with a fast Japanese tokenizer for full-text search, enabling precise word-based queries, rankings, and highlights on Japanese content.

How It Works

1
🔍 Find Japanese search helper

You discover a handy tool that makes searching Japanese words in your simple database fast and smart, like breaking sentences into perfect search pieces.

2
📥 Download ready file

Pick and download the matching file for your computer from the list of prepared packages, some even come with built-in Japanese word knowledge.

3
🗄️ Open your database

Launch the program holding your collection of Japanese texts or notes.

4
Wake up Japanese smarts

Add the downloaded helper to your database with one easy instruction, unlocking word-aware searching for Japanese.

5
📝 Build search space

Set up a special zone in your database designed for quick Japanese text lookups.

6
Load your writings

Drop in your Japanese documents, articles, or notes into the search zone.

7
🔎 Hunt for treasures

Type natural Japanese phrases to search, see ranked results with glowing matches, feeling the power of precise finds.

🎉 Master Japanese searches

Enjoy lightning-fast, accurate discovery of info in your Japanese texts, every time you search.

Sign up to see the full architecture

6 more

Sign Up Free

Star Growth

See how this repo grew from 11 to 11 stars Sign Up Free
Repurpose This Repo

Repurpose is a Pro feature

Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.

Unlock Repurpose
AI-Generated Review

What is sqlite-vaporetto?

This GitHub Rust SQLite project is a loadable FTS5 extension that brings fast Japanese full-text search to standard SQLite databases using the Vaporetto tokenizer. It segments Japanese text into words before indexing, letting you run MATCH queries, bm25 ranking, and highlight snippets on Japanese content without leaving SQLite. Download pre-built binaries from GitHub releases for Linux, macOS, or Windows, load via `.load libsqlite_vaporetto.so`, and create FTS5 tables with `tokenize='vaporetto'`.

Why is it gaining traction?

Unlike generic SQLite FTS5 tokenizers, it handles Japanese morphology natively for accurate word-based searches, including fuzzy sqlite fts5 search via helpers like `vaporetto_and_query` and `vaporetto_or_query`. You get sqlite fts5 performance boosts on Japanese text, case-insensitive ASCII matching, and tag filtering for nouns or verbs, all in a compact extension. Helper SQL functions build safe queries from user input, skipping whitespace tokens.

Who should use this?

Backend developers building Japanese document search in SQLite-powered apps, like mobile databases or embedded systems. Teams needing sqlite fts5 example setups for blogs, wikis, or e-commerce catalogs with Japanese product names. Anyone extending SQLite for fast, portable FTS without external search engines.

Verdict

Solid niche sqlite fts5 extension for Japanese search, with clear docs, tests, and easy GitHub download—but only 11 stars and 1.0% credibility score signal early maturity. Try it if you need this exact feature; otherwise, stick to proven alternatives.

(178 words)

Sign up to read the full AI review Sign Up Free

Similar repos coming soon.