1
0
mirror of https://github.com/osmarks/meme-search-engine.git synced 2025-02-23 06:20:04 +00:00

80 Commits

Author SHA1 Message Date
osmarks
3e568ff613 restore stashed code 2025-01-18 19:07:16 +00:00
osmarks
fcd28a5ede add missing files 2025-01-18 17:09:44 +00:00
osmarks
a5a6e960bb query code 2025-01-18 17:09:00 +00:00
osmarks
63caba2746 integrate rating model 2025-01-18 11:29:03 +00:00
osmarks
d3fcedda09 cleanup 2025-01-18 07:25:21 +00:00
osmarks
0a542ef579 repurpose meme rater 2025-01-18 07:19:08 +00:00
osmarks
163dceca4b enforce r, oops 2025-01-16 21:36:15 +00:00
osmarks
9291eacead remove rogue debug code 2025-01-16 21:28:01 +00:00
osmarks
f4376f62ed RobustVamana algorithm for big index run 2025-01-16 21:10:12 +00:00
osmarks
d341a8c243 accursed k-means balance thing 2025-01-16 16:02:59 +00:00
osmarks
5ab91aa17e CLI switch for L 2025-01-14 08:31:03 +00:00
osmarks
2cebce1b73 new PQ training code 2025-01-14 07:46:09 +00:00
osmarks
4dd97631df fix entire index algorithm (very silly bug) 2025-01-12 19:48:53 +00:00
osmarks
0a196694b1 minor tweaks 2025-01-11 12:19:10 +00:00
osmarks
44ffe80102 fix gitignore 2025-01-11 07:54:23 +00:00
osmarks
087419f470 remove vestigal r_cap 2025-01-11 07:36:46 +00:00
osmarks
8ce51bcb56 correct DiskANN algorithm (silly bug with greedy search) 2025-01-11 07:35:04 +00:00
osmarks
e9ee563381 tweak some parameters 2025-01-03 09:22:39 +00:00
osmarks
4d08ec370b oops, dependencies 2025-01-02 21:04:50 +00:00
osmarks
265502f141 tweak index build, this had better work, aaa 2025-01-02 21:04:26 +00:00
osmarks
f1283137d6 release WIP DiskANN index orchestration code 2025-01-01 14:40:24 +00:00
osmarks
35df1201e2 fix overflow bug 2025-01-01 14:40:00 +00:00
osmarks
e0cf65204b release early draft of index code 2024-12-31 23:05:48 +00:00
osmarks
512b776e10 use slightly worse image scaling 2024-11-13 18:31:18 +00:00
osmarks
8097ce8d91 improve dump processing and misc performance fixes 2024-11-11 19:43:07 +00:00
osmarks
c277b49dc1 fix resumption, oops 2024-11-07 20:43:26 +00:00
osmarks
b9bb629e6f performance improvements 2024-11-07 16:52:58 +00:00
osmarks
7fa14d45ae improve observability and fix up Reddit dump for full-scale run 2024-11-02 19:38:05 +00:00
1d0ff95955 Sparse autoencoder testing 2024-10-05 17:22:44 +01:00
fc6d0c9409 Fix crawler (rate limit changes) 2024-07-15 18:48:55 +01:00
43ff9215fb File metadata storage.
The backend now knows how big images are, so the frontend can size images correctly before they're loaded.
This should significantly improve the UI on slow connections.
Also fix bug where videos weren't erased from the index properly.
2024-06-26 20:02:12 +01:00
1ab254ff1d Adjust index storage for memory efficiency and fix SQLite interface type confusion 2024-06-25 08:23:30 +01:00
e7adf738f6 Fix typo, in the sense of application-killing bug. 2024-05-31 00:35:16 +01:00
747058e254 misc fixes
- thumbnails/OCR off was broken
- problematic video files caused segfaults (I blame ffmpeg for this)
2024-05-30 19:05:54 +01:00
3257521068 Video search 2024-05-30 15:58:31 +01:00
74d91d52e5 probably I should do better testing 2024-05-29 21:25:50 +01:00
5eae8674ce video parsing basics 2024-05-28 22:28:41 +01:00
129b769a56 hackily patch horrifyingly nondeterministic-but-fast image encoder in 2024-05-27 20:21:44 +01:00
d4e136b6a7 AITemplate builds of the image encoder work, at great personal cost 2024-05-27 19:05:25 +01:00
a8329e43fc more progress on Reddit 2024-05-27 15:22:28 +01:00
f8d68d9d54 WIP Reddit dump loader 2024-05-24 17:47:18 +01:00
978aadda6a Improved UI for sliders 2024-05-22 20:26:23 +01:00
d8c147df52 Predefined embedding modes in search 2024-05-22 20:17:13 +01:00
14387a61a3 refactor configuration 2024-05-22 19:02:34 +01:00
ffc3d648a6 basic monitoring implementation 2024-05-22 18:49:32 +01:00
ce590298a7 concurrent index queries and fix database typo yet again 2024-05-22 18:25:50 +01:00
349fe802f7 meme interpretability 2024-05-22 16:18:45 +01:00
bd426a30ba Port meme acquisition pipeline to new API, database
Also fix a really stupid oversight in crawling code.
2024-05-22 15:43:56 +01:00
30b1b72712 I really should test database queries better 2024-05-22 14:35:29 +01:00
9455438bab frontend fixes 2024-05-21 20:17:23 +01:00