"Overview produces intricate visualizations of large document sets — beautiful, but what do they mean? These visualizations are saying something about the documents, which you can interpret if you know a little about how they’re plotted.
There are two visualizations in the current prototype version of Overview, and both are based on document clustering."
Twapperkeeper, a Twitter archive analysis tool is now integrated with HootSuite - check it out if you're thinking of mining the Twittersphere
Research Software: for making changes in text files whereever certain patterns appear or extracting data from parts of certain lines while discarding the rest.
runs uploaded documents through OpenCalais, giving you access to extensive information about the people, places, and organizations mentioned in each
resources at the UO for meeting NSF grant data management requirements
Open Access Data Protocol: good solution for data management requirements imposed by NSF and other agencies
"Increasingly, scientific breakthroughs will be powered by advanced computing capabilities that help researchers manipulate and explore massive datasets.
The speed at which any given scientific discipline advances will depend on how well its researchers collaborate with one another, and with technologists, in areas of eScience such as databases, workflow management, visualization, and cloud computing technologies.
In The Fourth Paradigm: Data-Intensive Scientific Discovery, the collection of essays expands on the vision of pioneering computer scientist Jim Gray for a new, fourth paradigm of discovery based on data-intensive science and offers insights into how it can be fully realized."