Data Egnineer, Open Source Software enthusiast, Apache Software Foundation committer.
I'm developing in Python, Scala/Java and some Rust. Mostly my activities are related to the Apache Spark / PySpark ecosystem and Data Engineering tools.
I'm a maintainer at the following projects:
- GraphFrames -- scalabale graph algorithms on top of Apache Spark DataFrames.
- Apache GraphAr (incubating) -- universal "open-table" format for storing Property Graphs.
- spark-fast-tests -- Apache Spark testing helpers and assertions (Scala).
- chispa -- Apache Spark testing helpers and assertions (Python).
- falsa -- CLI tool for generating datasets of the H2O benchmark. Wriiten in Rust.
And other various projects.
Wataktime weekly stats:
Scala 3 hrs 1 min █████████████▓░░░░░░░░░░░ 54.18 %
sbt 1 hr 5 mins █████░░░░░░░░░░░░░░░░░░░░ 19.69 %
Markdown 48 mins ███▓░░░░░░░░░░░░░░░░░░░░░ 14.51 %
TOML 14 mins █░░░░░░░░░░░░░░░░░░░░░░░░ 04.34 %
Python 14 mins █░░░░░░░░░░░░░░░░░░░░░░░░ 04.26 %
About any open source activities and / or collaborations you can reach me using [email protected].
About any other activities and / or collaborations you can reach me using my private email [email protected].