Roger Chen's Library tagged → View Popular
SPEAR Algorithm - Michael G. Noll
Telling Experts from Spammers: Expertise Ranking in Folksonomies"
Technology Review: A Better Way to Rank Expertise Online
-
the fact is [that] quantity does not imply quality.
-
The new algorithm is called Spamming-resistant Expertise Analysis and Ranking (SPEAR) and is based on the well-known information-retrieval algorithm called HITS that is used by search engines like Google to rank Web pages.
- 4 more annotations...
Michael Nielsen » The Google Technology Stack
Part of what makes Google such an amazing engine of innovation is their internal technology stack: a set of powerful proprietary technologies that makes it easy for Google developers to generate and process enormous quantities of data. According to a senior Microsoft developer who moved to Google, Googlers work and think at a higher level of abstraction than do developers at many other companies, including Microsoft: “Google uses Bayesian filtering the way Microsoft uses the if statement” (Credit: Joel Spolsky). This series of posts describes some of the technologies that make this high level of abstraction possible.
Data Miners Blog: Data Mining and Statistics
-
The way I think about it, data mining is the process of using data to figure stuff out.
-
There is, however, a cultural difference between people who call themselves statisticians and people who call themselves data miners. This difference has its origins in different expectations about data size.
Introducing Apache Mahout
Once the exclusive domain of academics and corporations with large research budgets, intelligent applications that learn from data and user input are becoming more common. The need for machine-learning techniques like clustering, collaborative filtering, and categorization has never been greater, be it for finding commonalities among large groups of people or automatically tagging large volumes of Web content. The Apache Mahout project aims to make building intelligent applications easier and faster. Mahout co-founder Grant Ingersoll introduces the basic concepts of machine learning and then demonstrates how to use Mahout to cluster documents, make recommendations, and organize content.
Center for Data Insight - Data Mining Research Lab
The Center for Data Insight (CDI) is an applied research center partnered with the latest vendors of enterprise ready products covering the entire spectrum of the Knowledge Discovery and Data Mining process. The CDI contains advanced parallel computers, servers, and workstations. The CDI also contains an extensive suite of industry leading tools for database, data cleansing, data quality analysis, data preparation and aggregation, statistical analysis, data mining predictive modeling, and data visualization.
Selected Tags
Related Tags
Sponsored Links
Top Contributors
Groups interested in research
-
web 2.0 research
A collection of resources f...
Items: 31 | Visits: 2493
Created by: Mark Marino
-
Online identity research
Collection of resources for...
Items: 276 | Visits: 2313
Created by: Adam Bohannon
-
Biology
focus on science of living ...
Items: 63 | Visits: 1892
Created by: Sheryl A. McCoy
Highlighter, Sticky notes, Tagging, Groups and Network: integrated suite dramatically boosting research productivity. Learn more »
Join Diigo
