http://lucene.apache.org/hadoop/
Hadoop is a distributed computing platform / filesystem. It provides HDFS which is similar to GFS (Google Filesystem) and MapReduce which splits programming tasks into small chunks. It's target is 10'000 node clusters.