Part of Spark’s appeal is that it can process data in computer memory, as opposed to just using hard disks, much move at slower speeds. But because the amount of data that can fit in-memory is limited, the tool can process data on disks as well, and that’s what Databricks was trying to highlight as it sought to break’s Yahoo’s record on the Gray Sort, which measures the time it takes to sort 100 terabytes of data, aka 100,000 gigabytes. Yahoo did the sort in 72 minutes with a cluster of 2,100 machines using Hadoop MapReduce last year. Databricks was able to process the same amount of data in 23 minutes using Spark, using only 206 virtual machines running on Amazon’s cloud service. It also sorted a petabtye of data—about 1,000 terabytes — in less than four hours using 190 machines.
Creepy but cool interactive, “Creaturefy Yourself" from Animal Planet. Teens, especially boys, will love this. Just upload a photo and choose facial features from a variety of gruesome characters. Just thinking these could be used to create story characters, a Halloween bulletin board, or maybe a fun Halloween avatar.
I know nothing in the world that has as much power as a word. Sometimes I write one, and I look at it, until it begins to shine.
Social progressivism seems to be having its day in the US. Kerry talks to the world of the Climate Change threat. If only he could rebrand the issue as one of human rights, he might have some better luck getting national and international conversations to steer toward action.
My subconscious just lashes out while I’m sleeping. My dreams last night were so bizarre. Giant hornets. Bisexuality. Farscape. The Wizard of Oz. Church choirs. Edelweiss. Fractions.