College4Dummies
IU
Big red crayBig red 2 cray
and a couple older monster computers
Purdue
Honors Dorm required for freshmen
Berkely
AMPLab (the AMP in which is short for algorithms, machines and people)
Apache Spark™ is a fast and general engine for large-scale data processing. Run programs 100x faster than Hadoop MapReduce in memory, or 10x faster on disk.
Apache Mesos abstracts CPU, memory, storage, and other compute resources away from machines (physical or virtual), enabling fault-tolerant and elastic distributed systems to easily be built and run effectively. (up to 10k nodes!) replicated master and slaves using ZooKeeper (Support for Docker containers)
Succinct (2015 in-mem db) blew away popular key-value stores MongoDB, Cassandra and HyperDex in terms of storage efficiency. Storing a 100-gigabyte dataset in memory with those systems required spreading the load across 16 machines with 64 gigabytes of RAM apiece. That’s because the secondary indices were 10 times the size of the original. Succinct fit 123 gigabytes of raw data onto a single 64-gigabyte machine! by 2016 “You actually [could] execute SQL queries directly on Succinct,” he explained. “… For the user, everything looks opaque. For him, it doesn’t matter whether there’s compression or not.”
MIT
Open courseware. E/E
Comments
Post a Comment