Real-time Analytics with Apache Druid at FullContact
Jeremy Plichta, Director of Engineering at FullContact; Janis Dancis, Sr. Software Engineer at FullContact; and Gian Merlino, co-founder and CTO of Imply talk all about Druid in their latest meet-up on September 17, 2019. Watch the replay below and see what the hype about Apache Druid is all about!
FullContact Talk Summary
FullContact is building one of the leading identity resolution as a service platforms to help brands and businesses connect to their customers on a more personal level. Doing this means keeping track of billions of different identity resolution events that occur through both API and Batch. When going to the whiteboard to build a system that could track all of this FullContact came up with a streaming pipeline architecture that flowed all usage into both S3 and Druid. This new system has offered immense flexibility to scale and give customers near real time insight into their API usage patterns. In this talk Janis and Jeremy will discuss what this pipeline looks like at a high level, some interesting problems they had to solve along the way and ideas on other Druid features they really should be leveraging to make the whole thing even better!
Imply Talk Summary
The dirty secret of most “streaming analytics” technologies is that they are just stream processors: they sit on a stream and continuously compute the results of a particular query. They’re good for alerting, keeping a dashboard up-to-date in real time, and streaming ETL, but they’re not good at powering apps that give you true insight into what is happening: for this, you need the ability to explore, slice/dice, drill down, and search into the data. This talk will cover the current state of the streaming analytics world and what Apache Druid, a real-time analytical database, brings to the table.
Meet the Speakers
Jeremy Plichta is a Director of Engineering at FullContact where he helps lead the DevOps, Foundations/Integrations, and Application Security Teams. He has worked with several technologies like Hadoop, Spark, and Kafka in the past and helped launch Apache Druid into the FullContact ecosystem to help solve API usage and aggregation problem. When he isn’t working he enjoys spending time with his wife and 3 kids, reading great sci-fi books, working out and snowboarding.
Janis Dancis is a Sr Software Engineer at FullContact where he is focused on building systems to capture and analyze our high volume API usage data, provide tooling to integrate this data into our accounting, invoicing and other back office systems, and developing front end applications that our customers use to interact with our platform. His favorite JVM technology is Clojure, which he is still trying to use to build a rocketship at FullContact. He gets his thrills outside of the office by racing rally cars up steep hills and taking tight corners.
Gian Merlino is a co-founder and the CTO of Imply, a San Francisco based technology company. Gian is also one of the main committers of Druid. Previously, Gian led the data ingestion team at Metamarkets and held senior engineering positions at Yahoo. He holds a BS in Computer Science from Caltech.
Photos from the Event:
Join the FullContact Team!
FullContact is the premier provider of SaaS-based identity resolution that empowers brands to improve their customer experience and authentically engage with consumers. Using a consumer-first approach with our product offerings, we aim to make relationships better and that starts with our employees.
Find out more about our open positions and apply today on our careers page.
October 20, 2022 Empowering Cohesive Customer Experiences Identity Resolution, Resolve, Customer Experience, Marketing & Sales, Enrich API
October 17, 2022 Frictionless Fraud Prevention Made Easy with FullContact’s Verify.Match Identity Resolution, Engineering, Privacy, Partnership, Customer Experience, Marketing & Sales
October 7, 2022 Inferred Identity Identity Resolution, Engineering