fabiosv's CodersRank profile

Fabio Valonga

Sao Paulo, Brazil

Intro

I am a versatile data professional with a background spanning data engineering, NLP, and web development. With a strong track record of creating and optimizing data pipelines, leading teams, and improving code quality, I thrive on complex challenges. My experience includes working with diverse tech stacks and cloud platforms, ensuring efficient data processing and analytics. I am passionate about delivering data-driven insights and fostering a culture of excellence in every role I take on.

Scores & Badges

CodersRank Score

951.6

CodersRank Rank

Top 1%

Associate Developer

Ruby

Mid Developer

JavaScript

Mid Developer

Python

Show all badges

Tech Skills

Highest experience points: 0 points,

Timeline

Activity Chart

0 activities in the last year

Language overview

JavaScript

118.8

exp.

Top 7% out of 279K Worldwide Top 13% out of 4K Brazil

Python

340.3

exp.

Top 0.9% out of 165K Worldwide Top 2% out of 3K Brazil

Ruby

16.4

exp.

Top 14% out of 48K Worldwide Top 22% out of 1K Brazil

JSON

204.6

exp.

Top 2% out of 282K Worldwide Top 5% out of 4K Brazil

CSS

17.7

exp.

Top 28% out of 265K Worldwide Top 42% out of 4K Brazil

HTML

14.2

exp.

Top 32% out of 292K Worldwide Top 48% out of 4K Brazil

Jupyter Notebook

exp.

Top 9% out of 37K Worldwide Top 10% out of 699 Brazil

Shell

13.9

exp.

Top 9% out of 139K Worldwide Top 10% out of 3K Brazil

Java

18.1

exp.

Top 29% out of 130K Worldwide Top 32% out of 3K Brazil

Scala

36.2

exp.

Top 12% out of 9K Worldwide Top 5% out of 136 Brazil

SQL

16.8

exp.

Top 4% out of 52K Worldwide Top 5% out of 1K Brazil

TSQL

20.3

exp.

Top 3% out of 54K Worldwide Top 5% out of 2K Brazil

HCL

21.1

exp.

Top 13% out of 8K Worldwide Top 11% out of 187 Brazil

Technologies

Work Experiences

List your work history, including any contracts or internships

iFood

Feb 2024 - Sep 2024 (7 months)

Remote

Data Platform Engineer

asd

aws kubernetes kafka python3 terraform prometheus grafana helm

Hvar Consulting Service

Jun 2023 - Feb 2024 (8 months)

Remote

Tech Lead - Data Engineer

Summary:
• Migrating data from SAP Data Warehouse to Azure Storage Gen2/Databricks Lakehouse (Unity Catalog), enabling seamless integration with PowerBI reports
• Providing leadership to the team and ensuring the successful execution of projects
• Ingest SAP tables through Azure Data Factory Pipeline and dispose in StarSchema arquitecture
• Supporting streaming data pipelines utilizing Spark Structured Streaming, Change Data Feed, and Delta Live Tables
• Explore View Materializing with Spark Streaming for incremental joins

Day-to-day responsibilities:
• Designing data pipeline architectures to ensure optimal performance
• Assisting the team in overcoming technical challenges and roadblocks
• Mentoring and guiding squad members in their career development
• Defining and promoting code patterns for reusable and maintainable code for Databricks and Data Factory

Improvements/Accomplishments:
• Created a generic framework that supports all data wrangling.
• Reduced computer costs by optimizing cluster usage/resources and applying Spark Structured Streaming
• Reduced cloud costs by migrating Data Factory parallelism to Databricks Jobs run in Spark Streaming pipelines

Technology Stack:
• Azure Data Factory, Azure Functions, Azure Storage Gen2
• Azure Databricks

databricks azure pyspark unity catalog

Softensity

Apr 2022 - Jun 2023 (1 year 2 months)

Remote

Data Engineer

Summary:
• Migrating data from RDS to S3 using EMR and Databricks, which involved Spark and Hadoop
• Establishing a data-serving mechanism from S3 through REST API, developed using Node.js and Express
• Creating a robust Data Pipeline leveraging Delta Lake libraries and Kafka

Day-to-day responsibilities:
• Processed historical data, optimizing performance and efficiency by coalescing thousands of small files and database dumps into a Delta Table using batch and stream processing, specifically Databricks Autoloader.
• Applied intricate business rules to the data, employing PySpark for data wrangling
• Supported an ongoing pipeline, adding new datasets with Scala, reading Kafka topics, applying deduplication, business rules, and saving them to S3
• Facilitated data access via a REST API developed with Node.js and Express
• Pioneered code patterns for reusability and maintainability

Improvements/Accomplishments:
• Creating a Historical pipeline responsible for ingesting more than 1 Petabyte of data, demonstrating my capability to handle large-scale data operations
• Developing reports for data quality, aiding in the identification of erroneous or missing data and rule changes
• Supporting the creation of monitoring tools to ensure the health of the data pipeline
• Reducing operational costs through cluster optimization and adhering to clean code practices

Technology Stack:
• Leveraging Java and Python for Blockchain and REST API collectors
• Employing Databricks pipelines powered by PySpark, Delta Lake, and Autoloader
• Crafting EMR pipelines using Spark and Scala
• Managing RDS with PostgreSQL databases
• Serving data through REST API on ECS with Node.js
• Facilitating data access via WebSockets using Python

This role not only showcased my proficiency in data engineering but also highlighted my problem-solving skills, adaptability, and commitment to maintaining data quality and pipeline efficiency.

nodeJS Databricks aws spark python ganglia hdfs kafka Big Data quicksight data visualization scala EMR

Intro

Scores & Badges

What is this?

Tech Skills

Timeline

Activity Chart

Language overview

Technologies

Work Experiences

Jobs for you

Would you like to improve your profile?