Alain Antone's Library tagged → View Popular, Search in Google
In the world of web scraping, text mining and article reading utilities (readability bookmarklet) there is an ever growing demand for utilities that are capable of distinguishing parts of a HTML document which represent an article apart from other common website building blocks like menus, headers, footers, ads etc.
The boilerpipe library provides algorithms to detect and remove the surplus "clutter" (boilerplate, templates) around the main textual content of a web page.
The library already provides specific strategies for common tasks (for example: news article extraction) and may also be easily extended for individual problem settings.
Extracting content is very fast (milliseconds), just needs the input document (no global or site-level information required) and is usually quite accurate.
Boilerpipe is a Java library written by Christian Kohlschütter. It is released under the Apache License 2.0.
Atlassian's software development tools help teams track issues and projects, find and review code, and improve testing and build management. Each tool integrates tightly with JIRA, Atlassian's issue tracker, providing a seamless experience for development teams. And IDE integrations put our tools where you work most.
Magnitude est un framework Open Source permettant d'utiliser un smartphone Android comme une plateforme de réalité augmentée.Le framework est facile à utiliser et orienté plugin.
<script src="http://aza.googlecode.com/svn/trunk/SocialHistory/SocialHistory.js"></script> <script> user = SocialHistory(); var visitsDigg = user.doesVisit("Digg"); var visitsSlashdot = user.doesVisit("Slashdot"); var listOfVisitedSites = user.visitedSites(); </script>
-
How do you recognise good programmers if you’re a business guy?
-
In his article The 18 mistakes that kill startups, Paul Graham makes the following point:
- 8 more annotation(s)...
Selected Tags
Related Tags
Top Contributors
Groups interested in code
-
iXLd.com
Web strategy specialists ach...
Items: 4 | Visits: 92
Created by: Ma Theresa Camartin
Diigo is about better ways to research, share and collaborate on information. Learn more »
Join Diigo
