Remove Data Quality Remove Natural Language Processing Remove White Paper
article thumbnail

Harvard professor: DataPerf and AI’s need for data benchmarks

Snorkel AI

If we believe that, yes, we want to actually benchmark the data, the next question becomes: what exactly do we want to do? Fundamentally, there are only three really primary pillars in the context of measuring data quality. First is how good is your training data? Second is how good is your test set data?

article thumbnail

Harvard professor: DataPerf and AI’s need for data benchmarks

Snorkel AI

If we believe that, yes, we want to actually benchmark the data, the next question becomes: what exactly do we want to do? Fundamentally, there are only three really primary pillars in the context of measuring data quality. First is how good is your training data? Second is how good is your test set data?