1
0
mirror of https://github.com/osmarks/meme-search-engine.git synced 2025-04-05 10:16:58 +00:00

99 Commits

Author SHA1 Message Date
osmarks
a1bf23055e docker 2025-03-26 12:59:02 +00:00
osmarks
5121de6389 frontend customizations 2025-03-26 12:52:56 +00:00
osmarks
66056fbe3a fix title 2025-03-26 12:43:45 +00:00
osmarks
ea3375a7bf ??? 2025-03-26 12:33:26 +00:00
osmarks
9af7eed0f9 update requirements again 2025-03-26 12:26:53 +00:00
osmarks
ec6cde1ca4 update openclip 2025-03-26 12:13:27 +00:00
osmarks
a1e694ed6d oops 2025-03-26 12:02:42 +00:00
osmarks
2fc0389f79 remove requirements not needed for rust backend 2025-03-26 11:56:27 +00:00
osmarks
14ede42b96 broaden python package versions (maybe works) 2025-03-26 11:50:00 +00:00
osmarks
435a9812dc model path argument 2025-03-26 11:02:40 +00:00
osmarks
ec27deddbf fixed SAE export code 2025-01-31 17:12:25 +00:00
osmarks
899fbb7092 fix embedding-from-URL feature 2025-01-31 14:10:15 +00:00
osmarks
e57931d47f Multithread query server
While profiling suggests that most operations are cheap and IO-bound rather than CPU-bound, the GEMM for deduplication is pretty slow. As such, use multiple threads for higher throughput.
2025-01-31 13:47:47 +00:00
osmarks
5215822e39 mlock 2025-01-29 14:48:15 +00:00
osmarks
3b2664ca98 update README 2025-01-25 16:56:54 +00:00
osmarks
257486678d connect metrics correctly 2025-01-24 15:23:53 +00:00
osmarks
ee23b81444 release version 2025-01-24 09:24:28 +00:00
osmarks
3852d0078d integrate rating model correctly 2025-01-23 13:45:59 +00:00
osmarks
fbf9b3421d fix lockfile 2025-01-18 19:35:21 +00:00
osmarks
3e568ff613 restore stashed code 2025-01-18 19:07:16 +00:00
osmarks
fcd28a5ede add missing files 2025-01-18 17:09:44 +00:00
osmarks
a5a6e960bb query code 2025-01-18 17:09:00 +00:00
osmarks
63caba2746 integrate rating model 2025-01-18 11:29:03 +00:00
osmarks
d3fcedda09 cleanup 2025-01-18 07:25:21 +00:00
osmarks
0a542ef579 repurpose meme rater 2025-01-18 07:19:08 +00:00
osmarks
163dceca4b enforce r, oops 2025-01-16 21:36:15 +00:00
osmarks
9291eacead remove rogue debug code 2025-01-16 21:28:01 +00:00
osmarks
f4376f62ed RobustVamana algorithm for big index run 2025-01-16 21:10:12 +00:00
osmarks
d341a8c243 accursed k-means balance thing 2025-01-16 16:02:59 +00:00
osmarks
5ab91aa17e CLI switch for L 2025-01-14 08:31:03 +00:00
osmarks
2cebce1b73 new PQ training code 2025-01-14 07:46:09 +00:00
osmarks
4dd97631df fix entire index algorithm (very silly bug) 2025-01-12 19:48:53 +00:00
osmarks
0a196694b1 minor tweaks 2025-01-11 12:19:10 +00:00
osmarks
44ffe80102 fix gitignore 2025-01-11 07:54:23 +00:00
osmarks
087419f470 remove vestigal r_cap 2025-01-11 07:36:46 +00:00
osmarks
8ce51bcb56 correct DiskANN algorithm (silly bug with greedy search) 2025-01-11 07:35:04 +00:00
osmarks
e9ee563381 tweak some parameters 2025-01-03 09:22:39 +00:00
osmarks
4d08ec370b oops, dependencies 2025-01-02 21:04:50 +00:00
osmarks
265502f141 tweak index build, this had better work, aaa 2025-01-02 21:04:26 +00:00
osmarks
f1283137d6 release WIP DiskANN index orchestration code 2025-01-01 14:40:24 +00:00
osmarks
35df1201e2 fix overflow bug 2025-01-01 14:40:00 +00:00
osmarks
e0cf65204b release early draft of index code 2024-12-31 23:05:48 +00:00
osmarks
512b776e10 use slightly worse image scaling 2024-11-13 18:31:18 +00:00
osmarks
8097ce8d91 improve dump processing and misc performance fixes 2024-11-11 19:43:07 +00:00
osmarks
c277b49dc1 fix resumption, oops 2024-11-07 20:43:26 +00:00
osmarks
b9bb629e6f performance improvements 2024-11-07 16:52:58 +00:00
osmarks
7fa14d45ae improve observability and fix up Reddit dump for full-scale run 2024-11-02 19:38:05 +00:00
1d0ff95955 Sparse autoencoder testing 2024-10-05 17:22:44 +01:00
fc6d0c9409 Fix crawler (rate limit changes) 2024-07-15 18:48:55 +01:00
43ff9215fb File metadata storage.
The backend now knows how big images are, so the frontend can size images correctly before they're loaded.
This should significantly improve the UI on slow connections.
Also fix bug where videos weren't erased from the index properly.
2024-06-26 20:02:12 +01:00