Edit ‘osmarks.net_web_search_plan’

2025-05-29 18:29:29 +00:00
parent faa91d4ef7
commit 3318e96d98
1 changed files with 1 additions and 0 deletions
@@ -20,6 +20,7 @@ The job of a search engine is to retrieve useful information for users. This is
 * {Images, PDFs, etc contain useful knowledge which hasn't been integrated properly into most things. We need* these.
 * Common Crawl doesn't even get PDFs because they're complicated to process!
 * Obscure papers, product user manuals, shiny reports from organizations.
+* https://arxiv.org/abs/2407.01449
 }
 * So much tacit knowledge is in videos. Oh no. Maybe we can get away with an autotranscriber and frame extraction.