Carlos Santos's Library tagged → View Popular
New tutorial - Building a search engine with Appengine and Yahoo — The Uswaretech Blog - Django Web Development
new tutorial on building a search engine using Appengine, and Yahoo Search API here. This uses pure Appengine API, and not Django, and is a tutorial on how to use Appengine without Django.
Announcing TweetMotif for summarizing twitter topics with a dash of NLP - Brendan O'Connor's Blog
some discussion of internals
Yahoo Placemaker: Extract Location Data from Any Text
yahoo free service to add geo information to data
-
In addition, Yahoo also announced that, starting today, it will allow developers to download and use the full data set of Yahoo's GeoPlanet. The GeoPlanet data, which contains information about millions of placenames in multiple languages, also forms the basis of Placemaker's geographical knowledge.The GeoPlanet data will be licensed under the Creative Commons Attribution license.
Clustering Billions of Images with Large Scale Nearest Neighbor Search
Image collections on this scale make performing even the most common and simple computer vision, image processing, and machine learning tasks non-trivial. An example is nearest neighbor search, which not only serves as a fundamental subproblem in many more sophisticated algorithms, but also has direct applications, such as image retrieval and image clustering. In this paper, we address the nearest neighbor problem as the first step towards scalable image processing. We describe a scalable version of an approximate nearest neighbor search algorithm and discuss how it can be used to find near duplicates among over a billion images.
-
Image collections on this scale make performing even the most common and simple computer vision, image processing, and machine learning tasks non-trivial. An example is nearest neighbor search, which not only serves as a fundamental subproblem in many more sophisticated algorithms, but also has direct applications, such as image retrieval and image clustering. In this paper, we address the nearest neighbor problem as the first step towards scalable image processing. We describe a scalable version of an approximate nearest neighbor search algorithm and discuss how it can be used to find near duplicates among over a billion images.
Whoosh
Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.
-
Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.
Selected Tags
Related Tags
Sponsored Links
Top Contributors
Groups interested in search
-
Web Search
Old and new web search engines
Items: 19 | Visits: 155
Created by: Joel Bennett
-
IBP_overview
iBusinessPromoter (IBP) is ...
Items: 13 | Visits: 310
Created by: nmstrategies _
Highlighter, Sticky notes, Tagging, Groups and Network: integrated suite dramatically boosting research productivity. Learn more »
Join Diigo
