Infoshare: Alejandro Saucedo - Real time NLP & machine learning with Spark Streaming, Kafka and Spacy
During Infoshare 2019 Alejandro gave a live presentation of NLP & machine learning with Spark Streaming, Kafka and Spacy.
The need for real time machine learning use-cases in production is increasing. This talk will provide a practical insight on how to build real time data streaming machine learning pipelines that are production ready. We will be covering a case study performing automated content moderation on Reddit comments in real time. We will dive into fundamental concepts of stream processing such as windows, watermarking and checkponting, and we will show how to use frameworks like Kafka, Spacy and Spark Streaming.
The speech took place on 8th May 2019 on DataTech Stage during Infoshare 2019.
Hungry for more knowledge? Want to be informed when 2020 tickets will be available?
Inform me, please!
HAVE ANY IDEA FOR CONTENT?
Contact the editorial team at:firstname.lastname@example.org