This link has been bookmarked by 99 people . It was first bookmarked on 19 Mar 2007, by jengalbells.
-
case blackopens access to the majority of it
-
-
What
we mean by 'Web 3.0' is that major web sites are going to be transformed into web
services - and will effectively expose their information to the world
-
-
Gianto WidiantoToday's Web has terabytes of information available to humans, but hidden from computers. It is a paradox that information is stuck inside HTML pages, formatted in esoteric ways ...
-
-
Today's Web has terabytes of information available to humans, but hidden from
computers. It is a paradox that information is stuck inside HTML pages,
formatted in esoteric ways that are difficult for machines to process. The so
called Web 3.0, which is likely to be a pre-cursor of the real semantic
web, is going to change this. What we mean by 'Web 3.0' is that major web sites
are going to be transformed into web services - and will effectively expose
their information to the world. -
The transformation will happen in one of two ways. Some web sites will follow
the example of Amazon, del.icio.us and Flickr and will offer their information
via a REST API. Others will try to keep their information proprietary, but it
will be opened via mashups created using services like Dapper,
Teqlo and Yahoo! Pipes - 8 more annotations...
-
-
Steve Hecom API's explained
-
-
Today's Web has terabytes of information available to humans, but hidden from computers. It is a paradox that information is stuck inside HTML pages, formatted in esoteric ways that are difficult for machines to process. The so called Web 3.0, which is likely to be a pre-cursor of the real semantic web, is going to change this. What we mean by 'Web 3.0' is that major web sites are going to be transformed into web services - and will effectively expose their information to the world.
-
-
-
The transformation will happen in one of two ways. Some web sites will follow the
example of Amazon, del.icio.us and Flickr and will offer their information via a REST
API. Others will try to keep their information proprietary, but it will be opened via
mashups created using services like Dapper, Teqlo and Yahoo! Pipes.
The net effect will be that unstructured information will give way to structured
information - paving the road to more intelligent computing. In this post we will
look at how this important transformation is taking place already and how it is likely to
evolve. -
One of the first web services
opened up by Amazon was the
E-Commerce service
. This service opens access to the majority of items in Amazon's
product catalog. The API is quite rich, allowing manipulation of users, wish lists and
shopping carts. However its essence is the ability to lookup Amazon's products. - 11 more annotations...
-
-
-
What
we mean by 'Web 3.0' is that major web sites are going to be transformed into web
services - and will effectively expose their information to the world. -
The net effect will be that unstructured information will give way to structured
information - paving the road to more intelligent computing. - 10 more annotations...
-
-
Patricia MossToday's Web has terabytes of information available to humans, but hidden from computers. It is a paradox that information is stuck inside HTML pages, formatted in esoteric ways ...
-
Adelbert Gthe entire system is turning into both a platform and the database. Yet, such transformations are never smooth. For one, scalability is a big issue. And of course legal aspects are never simple.
-
-
Today's Web has terabytes of information available to humans, but hidden
from computers. It is a paradox that information is stuck inside HTML pages, formatted in
esoteric ways that are difficult for machines to process. The so called Web 3.0, which is
likely to be a pre-cursor of the real semantic web, is going to change this. What
we mean by 'Web 3.0' is that major web sites are going to be transformed into web
services - and will effectively expose their information to the world. -
So how do these services get around the fact that there is no API? The answer is that
they leverage standardized URLs and a technique called Web scraping. Let's understand how
this works. In del.icio.us, for example, all URLs that have the tag book can be
found under the URL http://del.icio.us/tag/book; all URLs tagged with
the tag movie are at http://del.icio.us/tag/movie; and so on. The
structure of this URL is always the same: http://del.icio.us/tag[TAG].
So given any tag, a computer program can fetch the page that contains the list of sites
tagged with it. Once the page is fetched, the program can now perform the scraping - the
extraction of the necessary information from the page.How Web Scraping Works
Web Scraping is essentially reverse engineering of HTML pages. It can also be thought
of as parsing out chunks of information from a page. Web pages are coded in HTML, which
uses a tree-like structure to represent the information. The actual data is mingled with
layout and rendering information and is not readily available to a computer. Scrapers are
the programs that "know" how to get the data back from a given HTML page. They work by
learning the details of the particular markup and figuring out where the actual data is.
-
-
-
As more and more of the Web is becoming remixable, the entire system is turning into both a platform and the database. Yet, such transformations are never smooth. For one, scalability is a big issue. And of course legal aspects are never simple.
But it is not a question of if web sites become web services, but when and how. APIs are a more controlled, cleaner and altogether preferred way of becoming a web service. However, when APIs are not avaliable or sufficient, scraping is bound to continue and expand. As always, time will be best judge; but in the meanwhile we turn to you for feedback and stories about how your businesses are preparing for 'web 3.0'
-
-
Concepción Abraira Fernándezinteresante artículo
-
Andy BrudtkuhlThe so called Web 3.0, which is likely to be a pre-cursor of the real semantic web, is going to change this. What we mean by 'Web 3.0' is that major web sites are going to be transformed into web services - and will effectively expose their information to
-
jeunium jeuniumToday's Web has terabytes of information available to humans, but hidden from computers. It is a paradox that information is stuck inside HTML pages, formatted in esoteric ways ...
-
-
What
we mean by 'Web 3.0' is that major web sites are going to be transformed into web
services - and will effectively expose their information to the world.
-
-
Gary BurgeToday's Web has terabytes of information available to humans, but hidden from computers. It is a paradox that information is stuck inside HTML pages, formatted in esoteric ways that are difficult for machines to process. The so called Web 3.0, which is li
-
Webtwo Dozent
(tags: web2.0) -
Michel BauwensToday's Web has terabytes of information available to humans, but hidden from computers. It is a paradox that information is stuck inside HTML pages, formatted in esoteric ways that are difficult for machines to process. The so called Web 3.0, which is li
-
toplawstudentRSS Scraping and mixing services
-
timeuserToday's Web has terabytes of information available to humans, but hidden from computers. It is a paradox that information is stuck inside HTML pages, formatted in esoteric ways ...
-
-
Because most applications built
on top of this service drive traffic back to Amazon (each item returned by the service
contains the Amazon URL). -
it does
not open the del.icio.us database to the world. What it does do is allow
authorized mashups to manipulate the user information stored in del.icio.us.
-

Would you like to comment?
Join Diigo for a free account, or sign in if you are already a member.