Exploring a ‘Deep Web’ That Google Can’t Grasp
(by Frank van Harmelen)
From the New York Times this Monday:
One day last summer, Google
Beyond those trillion pages lies an even vaster Web of hidden data: financial information, shopping catalogs, flight schedules, medical research and all kinds of other material stored in databases that remain largely invisible to search engines.
The challenges that the major search engines face in penetrating this so-called Deep Web go a long way toward explaining why they still can’t provide satisfying answers to questions like “What’s the best fare from New York to London next Thursday?” The answers are readily available - if only the search engines knew how to find them.
(thanks to Mike Brodie for pointing to this one)
Related articles by Zemanta
- Google: “We’re Not Doing a Good Job with Structured Data” (readwriteweb.com)
Tags: Database, Deep Web, Google, Web search engine
![Reblog this post [with Zemanta]](http://img.zemanta.com/reblog_e.png?x-id=21558eb4-dee9-4315-a599-5b1a981f72e4)