Surajit Chaudhuri
Visit Schedule


Surajit Chaudhuri is a Distinguished Scientist at Microsoft Research. He started the AutoAdmin project on self-tuning database systems. Surajit has also worked in the area of data cleaning. Their research on both physical database design and data cleaning has been incorporated in Microsoft products and services such as SQL Server and Bing. Surajit did his Ph.D. from Stanford University and he is an ACM Fellow. He was awarded the ACM SIGMOD Contributions award in 2004 , the 10 year VLDB Best paper Award in 2007, and ACM SIGMOD Edgar F. Codd Innovations Award in 2011.


Talk: Error-Tolerant Record Matching

Wednesday, November 9
WEB 101
Refreshments at 3:20, talk starts at 3:40

Record Matching is a key element of data cleaning technology. Error-Tolerant Record Matching reconciles multiple representations of the same entity in the presence of errors such as spelling mistakes and abbreviations. In this talk, we describe some of the key scenarios and the underlying technology for error-tolerant record matching that we have developed as part of our Data Cleaning project at Microsoft Research.

Tuesday, November 8
Time Activity Who Where
7:00pm - 9:00pm Pick up and Dinner Feifei, Suresh, Jeff Mazza at 9th and 9th

Wednesday, November 9
Time Activity Who Where
8:00 - 10:00 Breakfast By himself At the hotel
10:00 - 10:30 Pick up Feifei Li From the hotel
10:30 - 11:00 Meeting, faculty Feifei Li MEB 3464
10:30 - 11:00 Meeting, faculty Suresh Venkatasubramanian WEB 3404
11:30 - 12:05 Meeting, Suresh's students Suresh's graduate students Suresh's office or the theory room
12:05 - 1:00 Lunch Feifei, Suresh, Jeff TBD
1:00 - 1:30 Meeting, faculty Jeff M. Phillips MEB 3442
1:30 - 2:00 Meeting, Feifei's students Feifei's graduate students MEB 4158
2:00 - 2:30 Meeting, faculty Al Davis, Chair MEB 3190
2:30 - 3:00 Meeting, faculty Ganesh Gopalakrishnan MEB 3428
3:00 - 3:20 Break Feifei MEB 3464
3:20 - 3:40 Lecture prep (refreshments, etc.) WEB 101
3:40 - 5:00 Talk WEB 101
5:30 Quick Dinner Feifei TBD