Carlos Santos's Library tagged → View Popular
braindump: NOSQL debrief
The idea was to give attendees a solid introduction to how distributed, non relational databases work as well as an overview of the various projects out there.
biocep
With Biocep, R/Scilab computational engines are abstracted with URLs and can run at any location. They can be interactively controlled from the user's laptop either programatically or via an extensible, highly productive data analysis workbench or from highly programmable spreadsheets. The computational engines can be used as clusters on Grids and Clouds to solve computationally intensive problems, to build scalable analytical web applications or to expose functions as web services or nodes for workflow workbenches. They can also be used to distribute numerical/statistical user interfaces created with drag-and-drop tools and can be accessed simultaneously by several users to work with data collaboratively.
Tutorial: Scientific and parallel computing using IPython | Python for Scientific and Large Scale Computing
"This series introduces scientific and parallel computing using IPython with emphasis on IPython on a Windows PC. We discuss best practices for effectively using IPython with numpy, scipy, and matplotlib, as well has using IPython for interactive parallel computation." By J. Unpingco, who created the parallel ipython+vision (visual programming environmente) demo.
Cloudera Hadoop & Big Data Blog » Blog Archive » Building a distributed concurrent queue with Apache ZooKeeper
ZooKeeper is a system for coordinating distributed processes. In a distributed environment, getting processes to act in any kind of synchrony is an extremely hard problem. For example, simply having a set of processes wait until they’ve all reached the same point in their execution - a kind of distributed barrier - is surprisingly difficult to do correctly. ZooKeeper offers an API to facilitate this sort of distributed coordination.
Amazon Web Services Developer Community : Any AWS MapReduce examples using the R ...
discussion of R for big data processing
22S:295-HPC Home Page
High Performance Computing in Statistics: course notes; uses R running on a cluster
Plurk Open Source - LightCloud - Distributed and persistent key value database
Distributed and persistent key-value database
Amazon Elastic MapReduce
Using Hadoop on Amazon's "cloud"
-
Amazon Elastic MapReduce is a web service
-
t utilizes a hosted Hadoop framework running on the web-scale infrastructure of Amazon Elastic Compute Cloud (Amazon EC2) and Amazon Simple Storage Service (Amazon S3)
Selected Tags
Related Tags
Sponsored Links
Top Contributors
Groups interested in distribu...
-
Connectivism
Resources on connecting, di...
Items: 2 | Visits: 34
Created by: Frank in Mexico
-
Participatory Design
Resources in English or Fre...
Items: 8 | Visits: 30
Created by: doremi do
-
Teaching with Historical Documents
See the resource list belo...
Items: 85 | Visits: 64
Created by: Adele L
Highlighter, Sticky notes, Tagging, Groups and Network: integrated suite dramatically boosting research productivity. Learn more »
Join Diigo
