Chen Li is a professor in the Department of Computer Science at UC Irvine. He received his Ph.D. degree in Computer Science from Stanford University, and his M.S. and B.S. in Computer Science from Tsinghua University, China, respectively. His research interests are in the field of data management, including data-intensive computing, query processing and optimization, visualization, and text analytics. His current focus is building open source systems for data management and analytics. He was a recipient of an NSF CAREER Award, several test-of-time publication awards, and many grants and industry gifts. He was once a part-time Visiting Research Scientist at Google. He founded a company to commercialize university research.
Talk InformationBig Data as a Service for Data Scientists11:15 to 12:15 April 13WEB 3780
Data scientists and domain experts often face challenges when dealing with big data, especially due to the large scale and limited IT knowledge and infrastructure maintenance skills. In this talk, I will present several software solutions we are developing to support Big Data as a service to these users. These solutions include Apache AsterixDB as an open source parallel database, Cloudberry as a middleware system to support data visualization, and Texera as a system to enable browser-based text analytics using declarative workflows. These solutions can be integrated to support ingestion, storage, indexing, querying, visualization, and analytics of big data. As an example, we will report experiences of using these solutions to support management of large-scale social media data (e.g., billions of tweets in terabytes) as a service to researchers of various disciplines such as social science and public health from several schools and universities.