
Infoshare: Alejandro Saucedo - Real time NLP & machine learning with Spark Streaming, Kafka and Spacy
During Infoshare 2019 Alejandro gave a live presentation of NLP & machine learning with Spark Streaming, Kafka and Spacy.
The need for real time machine learning use-cases in production is increasing. This talk will provide a practical insight on how to build real time data streaming machine learning pipelines that are production ready. We will be covering a case study performing automated content moderation on Reddit comments in real time. We will dive into fundamental concepts of stream processing such as windows, watermarking and checkponting, and we will show how to use frameworks like Kafka, Spacy and Spark Streaming.
The speech took place on 8th May 2019 on DataTech Stage during Infoshare 2019.
Hungry for more knowledge? Want to be informed when 2020 tickets will be available?
Inform me, please!
Tags:
See also:
LATEST NEWS
Infoshare 2025 | Shape Tomorrow Today 10.02.2025
Ticket sales have started! | Infoshare 2025 23.01.2025
Infoshare Startup Contest 2025 is... ON! 🔥 09.01.2025
Dziękujemy za sezon 2024! 31.12.2024
Warsztat Infoshare Katowice: Zamień observability w przewagę 20.11.2024
Zapisz się na Round Tables, Speed Dating i Matchmaking! 🗣 13.11.2024