• Drop #537 (2024-09-30): It’s About Time

    Today’s Drop covers three resources centered on time: “Time as a Grid,” a visual representation of time using grids; Greptime, a database for managing time series data; and Primitive, an image processing tool that uses geometric shapes.


  • Drop #519 (2024-08-22): Cache Me If You Can

    Today’s Drop discusses creating a lightweight internet research setup by integrating SearXNG, Jina, and a small Go program to provide a caching layer for search results. SearXNG is a self-hostable metasearch engine, while Jina’s Reader endpoint converts URLs to Markdown. Through a combined CLI, we can preview search results in the terminal and access cached…


  • Drop #517 (2024-08-15): Thursdataday

    Three data-focused resources are featured in today’s Drop. “committed” offers lightweigth, remote monitoring of GitHub repository commits, “sq” enables query and manipulation across various structured data sources, and “CSVs Are Kinda Bad” proposes replacing CSV with DSV for improved data handling and parsing.


  • Drop #514 (2024-08-09): DuckDB Vector Search

    Today’s Drop showcases DuckDB’s nascent vector search superpowers by walking through an example of how to generate embeddings for CISA KEV vulnerabilities, performing similarity search, and having Ollama guess good titles for clustered groups of KEV vulns. Tis def a code block-heavy post. #DuckDB #RStats


  • Drop #510 (2024-08-02): SQLite Vector Search

    Today’s Drop covers the introduction of two SQLite extensions, sqlite-vec and sqlite-rembed, designed to facilitate vector search operations. These extensions provide fast and efficient vector search functionality, sponsored by the Mozilla Builders project, and enable easy integration of text embeddings from “remote” APIs (which included ollama/llamafile). With further development potential, these extensions aim to democratize…


  • Drop #509 (2024-08-01): SQL Shenanigans

    Today’s Drop has some unique ways to use SQL ops in unconventional data sources through various tools. “pickaxe” allows SQL operations for web scraping’ “fselect” & “fsql” enables running SQL over the filesystem; and, “shellfs” is a DuckDB extension allowing shell command execution within SQL queries.