Chris Jermaine is a professor of computer science at Rice University. He is the recipient of an Alfred P. Sloan Foundation Research Fellowship, a National Science Foundation CAREER award, and an ACM SIGMOD Best Paper Award. He is the general chair for SIGMOD 2018 in Houston. In his spare time, Chris enjoys outdoor activities such as hiking, climbing, and whitewater boating. In one particular exploit, Chris and his wife floated a whitewater raft (home-made from scratch using a sewing machine, glue, and plastic) over 100 miles down the Nizina River (and beyond) in Alaska.
Talk InformationLarge-Scale Data Processing with the SimSQL System2016/01/20 9:50 to 11:30amEvans Conference Room, WEB 3780
In this talk, I'll describe the SimSQL system, which is a platform for writing and executing data- and compute-intensive codes over large data sets, with a particular emphasis on very large-scale statistical computations.
At its heart, SimSQL is really a relational database system, and like other relational systems, SimSQL is designed to support data independence. That is, a single declarative code for a particular statistical inference problem can be used regardless of data set size, compute hardware, and physical data storage and distribution across machines. One concern is that a platform supporting data independence will not perform well. But we've done extensive experimentation, and have found that SimSQL performs as well as other competitive platforms that support writing and executing machine learning codes for large data sets.