2 min readfrom Data Science

I built a full-text search CLI for all your databases and docs

Hi r/datascience 👋

I've spent a lot of time digging through databases & docs, and one thing that keeps slowing me (and my coding agents) is not being able to search across everything all at once.

So I built bm25-cli. It's a zero-config CLI that lets you run full-text search across your database schemas, tables, columns, keys, docs, comments, and metadata — in one command

So, how does it work?

Just point it at a source and search:

$ bm25 "payment handling refund" ./db_docs $ bm25 "payment handling refund" mysql://user@localhost/mydb $ bm25 "payment handling refund" postgres://user@localhost/mydb 

Mix and match:

$ bm25 "join error" postgres://user@localhost/mydb mysql://user@localhost/mydb ./mydocs 

No config files. No servers. No setup.

Works with everything

Source Example
Directory ./src, ., /home/user/project
Glob "**/*.md", "src/**/*.py"
PostgreSQL postgres://user@host/mydb
MySQL mysql://user@host/mydb
SQLite sqlite:./local.db
Website https://ngrok.com/docs/api

Why I find it useful

  • One command for everything — files, schemas, and docs in a single search
  • BM25 ranking — same algorithm that powers Elasticsearch and Lucene
  • Databases too — searches table names, columns, types, foreign keys, and comments
  • Fast after first run — indexes are cached in ~/.bm25/ and reused

If you're working with databases + coding agents, i'd love to hear what you think.

---

GitHub: https://github.com/statespace-tech/bm25

A ⭐ on GitHub really helps with visibility!

submitted by /u/Durovilla
[link] [comments]

Want to read more?

Check out the full article on the original site

View original article

Tagged with

#no-code spreadsheet solutions
#rows.com
#financial modeling with spreadsheets
#natural language processing for spreadsheets
#generative AI for data analysis
#Excel alternatives for data analysis
#row zero
#real-time data collaboration
#real-time collaboration
#spreadsheet API integration
#full-text search
#CLI
#databases
#BM25 ranking
#schemas
#tables
#columns
#zero-config
#metadata
#MySQL