GenAI Productionize 2.0: The premier conference for GenAI application development
In this article, Galileo founding engineer Nikita Demir discusses common data errors that NLP teams run into, and how Galileo helps fix these errors in minutes, with a few lines of code.
We used Galileo on the popular MIT dataset with a NER task, to find data errors fast, fix them, get meaningful gains within minutes, and made the fixed dataset available for use.
In this post, we discuss the Named Entity Recognition (NER) task, why it is an important component of various NLP pipelines, and why it is particularly challenging to improve NER models.
We used Galileo on the popular Newsgroups dataset to find data errors fast, fix them, get meaningful gains within minutes, and made the fixed dataset available publicly for use.
Machine Learning is advancing quickly but what is changing? Learn what the state of ML is today, what being data-centric means, and what the future of ML is turning into.
When working on machine learning (ML) projects, the challenges are usually centered around datasets in both the pre-training and post-training phases, with their own respective issues that need to be addressed. Learn about different ML data blind spots and how to address these issues.
Working with Natural Language Processing?
Read about Galileo’s NLP Studio