mirror of
https://github.com/osmarks/meme-search-engine.git
synced 2024-11-11 06:19:54 +00:00
osmarks
7cb42e028f
I decided I wanted to integrate the experimental OCR thing better, so I rewrote in Go and also integrated the thumbnailer. However, Go is a bad langauge and I only used it out of spite. It turned out to have a very hard-to-fix memory leak due to some unclear interaction between libvips and both sets of bindings I tried, so I had Claude-3 transpile it to Rust then spent a while fixing the several mistakes it made and making tweaks. The new Rust version works, although I need to actually do something with the OCR data and make the index queryable concurrently.
63 lines
1.1 KiB
JSON
63 lines
1.1 KiB
JSON
{
|
|
"db_name": "SQLite",
|
|
"query": "SELECT * FROM files WHERE filename = ?",
|
|
"describe": {
|
|
"columns": [
|
|
{
|
|
"name": "filename",
|
|
"ordinal": 0,
|
|
"type_info": "Text"
|
|
},
|
|
{
|
|
"name": "embedding_time",
|
|
"ordinal": 1,
|
|
"type_info": "Int64"
|
|
},
|
|
{
|
|
"name": "ocr_time",
|
|
"ordinal": 2,
|
|
"type_info": "Int64"
|
|
},
|
|
{
|
|
"name": "thumbnail_time",
|
|
"ordinal": 3,
|
|
"type_info": "Int64"
|
|
},
|
|
{
|
|
"name": "embedding",
|
|
"ordinal": 4,
|
|
"type_info": "Blob"
|
|
},
|
|
{
|
|
"name": "ocr",
|
|
"ordinal": 5,
|
|
"type_info": "Text"
|
|
},
|
|
{
|
|
"name": "raw_ocr_segments",
|
|
"ordinal": 6,
|
|
"type_info": "Blob"
|
|
},
|
|
{
|
|
"name": "thumbnails",
|
|
"ordinal": 7,
|
|
"type_info": "Blob"
|
|
}
|
|
],
|
|
"parameters": {
|
|
"Right": 1
|
|
},
|
|
"nullable": [
|
|
false,
|
|
true,
|
|
true,
|
|
true,
|
|
true,
|
|
true,
|
|
true,
|
|
true
|
|
]
|
|
},
|
|
"hash": "ec2da4ab11ede7a9a468ff3a50c55e0f6503fddd369f2c3031f39c0759bb97a0"
|
|
}
|