Meet Josh Patterson, a 2024 BigDATAwire Individual to Watch


We dwell in a world of massive knowledge and large compute. However what about large question engines? One of many startups creating software program to maintain up with large knowledge and large compute is Voltron Information, which is headed by Josh Patterson.

Patterson co-founded Voltron Information in 2021 with pandas creator Wes McKinney (a 2018 Individual to Watch) to develop next-generation knowledge processing expertise for the Python knowledge ecosystem. About a 12 months in the past, Voltron Information firm launched Theseus, which it claims runs many occasions quicker than Spark whereas costing many occasions much less.

We lately caught up with Patterson, who’s the CEO of Voltron Information and in addition one in all our 2024 BigDATAwire Individuals to Watch, to speak about his work at Voltron Information and the Python knowledge ecosystem.

BigDATAwire: Voltron Information states that its Theseus product is for “petabyte-scale ETL.”  Why have we not been in a position to transfer past ETL in any case these years?

Josh Patterson: A single system can’t deal with all duties right this moment; particularly as analytics and ML change into extra advanced, there are specialised programs optimized for particular workloads. We see this within the rise of GPUs for AI. Given this continuous evolution and complexity, ETL evolves into an important service for managing these divergent programs, and it’s now the bottleneck.

When AI/ML coaching adopted {hardware} accelerators like GPUs, it improved AI system efficiency by 100,000x. Nonetheless, knowledge preprocessing remains to be on CPUs, and efficiency has solely grown 10X within the final decade. Organizations on the forefront of AI are constrained by knowledge processing as a result of they can not afford to construct out large knowledge CPU clusters quick sufficient. The efficiency divergence between GPUs and CPUs is getting exponentially worse. Solely Theseus, Voltron Information’s accelerator-native knowledge analytics engine, is attaining a 60x efficiency improve with 50x price financial savings leveraging the identical accelerators utilized in AI. Till we discover one singular method to attract intelligence from knowledge, we’ll at all times have ETL, which can regularly have to get quicker and extra environment friendly.

BDW: How did your expertise engaged on RAPIDS at Nvidia assist put together you for Voltron Information?

JP: My time at NVIDIA the place I launched RAPIDS (an open supply suite of information processing and ML libraries designed to allow knowledge science workflows on GPU) was like working at a large startup. It moved quicker than most enterprises, targeted on cutting-edge expertise, pioneered new use circumstances and tapped into beforehand non-existent industries. We had been relentlessly innovating.

With RAPIDS, we consistently considered methods to speed up adoption and maturity. Leveraging the open requirements ecosystem, akin to Apache Arrow, allowed us to speed up our growth and actually deal with innovation as an alternative of redoing issues that already existed – a philosophy that continues at Voltron Information right this moment.

BDW: What function do you see Voltron Information filling within the Python knowledge ecosystem within the years to return?

JP: With tasks like Ibis, pyArrow, and ADBC, we anticipate the open requirements we construct, promote, and preserve will underpin the Python knowledge ecosystem. As well as, requirements like Arrow and Substrait exist to help a mess of languages past the pythonic ecosystems.

Bridging these language divides so enterprises can scale out and combine their myriad of information ecosystems is central to Voltron Information’s mission to deliver a brand new approach to design and construct knowledge programs.

BDW: Outdoors of the skilled sphere, what are you able to share about your self that your colleagues is likely to be shocked to study – any distinctive hobbies or tales?

JP: Most individuals don’t know that I come from a protracted line of builders. Early in my profession, I used to be a licensed normal contractor and nonetheless get pleasure from constructing issues round the home or with my household.

To learn the remainder of the 2024 Individuals to Watch interviews, click on right here.

 

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles