2 results for tag: blocking
A geeky deep-dive: database deduplication to identify victims of human rights violations
In our work, we merge many databases to figure out how many people have been killed in violent conflict. Merging is a lot harder than you might think.
Many of the database records refer to the same people--the records are duplicated. We want to identify and link all the records that refer to the same victims so that each victim is counted only once, and so that we can use the structure of overlapping records to do multiple systems estimation.
Merging records that refer to the same person is called entity resolution, database deduplication, or record linkage. For definitive overviews of the field, see Scheuren, Herzog, and Winkler, Data Quality ...
Beka Steorts Named MIT Under-35 Innovator
We’ve known for years that Beka Steorts is on the cutting-edge of statistical science, and now The MIT Technology Review has realized the same. Last week she was named one of 35 Innovators Under 35, in the category of humanitarian.
We first became familiar with Beka's work in 2013 when she was a visiting professor at Carnegie Mellon and was introduced to us by Prof. Steve Fienberg. Since then, we’ve felt very fortunate to collaborate with her on projects such as the UN enumeration of casualties in the Syrian conflict, and we look forward to many more years of work with her. She is one of several young stars we include in our superheroine hall ...