FastAI2022

Lesson 4

Re-working Fast.AI lesson 4, I transferred the approach from Jeremy’s notebook “Getting started with NLP for absolute beginners” to the Kaggle competition “Natural Language Processing with Disaster Tweets”.

When I started this project, I did not expect it to become such an extended endeavor. It introduced me to many different aspects of natural language processing in particular and machine learning in general. To share what I learned with the community, I recorded my approach and the key learnings in this blog post.

In the spirit of producing results quickly and training models early in the development process:

I started by creating a baseline-notebook in which I used the same approach as presented in the lecture, porting it pretty much 1:1.
In the final iteration (so far), I have incorporated quite a few “upgrades”. Which resulted in a score of 0.84676 and out me almost at the top of the leaderboard.

The key learnings:

Cleaning the data helps, both syntactically and semantically.
Upon cleaning the data, keep a close eye on what is noise and what is signal.
Helping the model understand the data helps by using special tokens.
Using bigger models helps. However, for training large models on Kaggle, you need to apply some tricks not to run out of memory.
Small batch sizes help.
Showing the model more data then just the initial training set helps.