Natural Language Processing from Scratch
11:15am - 11:55am on Saturday, October 7 in PennTop NorthBruno Gonçalves, Noemi Derzsy
- Audience Level:
- Intermediate
- Slides:
- https://www.slideshare.net/NoemiDerzsy/pygotham-ny-2017-natural-language-processing-from-scratch
- Watch:
- https://youtu.be/lNPXlqtJwcQ
Overview
The advent of sophisticated online services, has resulted in an unprecedented generation of textual content making NLP a fundamental tool in any data scientists toolkit. Here we introduce the rudiments of Natural Language Processing, from counting words to topic modeling and language detection.
Description
We introduce the fundamental technique of natural language processing using Python and OpenNasa datasets. In particular:
- bag of words models
- stop words
- tf/idf
- basic topic modeling
- word clouds
- language detection
A GitHub repository will be made available with all the code and slides used during the talk.