Striim, A Next Generation Distributed Streaming Platform
Striim, A Next Generation Distributed Streaming Platform
Speaker: Alok Pareek
February 25, 2021
Moderated by Professor Kia Teymourian
Abstract: Striim is a novel end-to-end distributed streaming ETL and intelligence platform that enables rapid development and deployment of streaming applications. A key area of innovation is to process database events in a continuous streaming manner while accommodating for schema drift, and failures. The real-time ETL engine has been architected from ground-up to enable simple, distributed, stateful management of events with E1P semantics. This talk will cover core architectural choices of the real time ETL engine (i) built-in adapters, and caching structures to extract and load data in real-time from legacy and new cloud endpoints (ii) an extensible SQL-based transformation engine to transform events; users can inject custom logic via a component called Open Processor (iv) New primitives like MODIFY, BEFORE and AFTER to manage database change records and (v) built-in data validation that continuously checks if everything is continually making it to the destination. We will cover real world implementation challenges and future open problems.
Guest Speaker Bio: Alok Pareek is Founder and head of engineering at Striim, a Palo Alto based software startup. Alok started his career as a developer in the Oracle kernel development team where he contributed to core redo generation algorithms, point in time media recovery, and high-speed data movement algorithms for ten years. Subsequently, Alok served as CTO and software architect at GoldenGate software, the industry leader in heterogeneous database replication. GoldenGate was acquired by Oracle in 2009. Post-acquisition, Alok led the product strategy for Oracle’s data integration software product portfolio including GoldenGate. Alok also led the engineering and performance teams that collaborated with strategic customers on architecture, and real-world implementations. He holds multiple patents in data management and has presented at numerous academic and industry conferences. Alok holds a graduate degree in Computer Science from Stanford University.