Skip to content
View bbucalonserra's full-sized avatar
πŸ˜„
πŸ˜„

Block or report bbucalonserra

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
bbucalonserra/README.md

Hi there, I'm Bruno Serra πŸ˜ƒ

Holding a Bachelor's Degree in Engineering and a Master's Degree in Data Science and Analytics, I'm completely passionate about data science and analytics, with a particular focus on leveraging machine learning and data analytics to extract insights for business growth. My experience centers on multinational corporations with millions of employees, where I've tackled massive datasets comprising billions of rows. Currently, I'm employed at AB InBev, the world's largest brewer based in Belgium, overseeing iconic global brands such as Budweiser, Corona, and Stella Artois. In my role, I specialize in designing and deploying analytical and statistical models to unearth valuable insights, empowering informed decision-making across the organization.

Programming languages


html15 html15 html15 html15 html15 html15 html15 html15 html15 html15

Technologies and applications:


html15 html15 html15 html15 html15 html15 html15 html15 html15

Main Projects

Data Science

  1. Delayed Deliveries Prediction

    Algorithm: Classification - Decision Tree

    One of the major problems faced by delivery companies is DELAYED DELIVERIES. In light of this, the idea emerged to create a predictive analysis using machine learning models to anticipate whether a delivery will be delayed or not based on the data collected from orders. There are no solid premises regarding the reasons for delays, only that, given that the datasets used are in a snowflake schema, a filter was applied to analyze only motorcyclists, always in the food segment, with delivery status always marked as delivered and order status as finalized.

  2. Order Days Prediction

    Algorithm: Regression - XGBoost Regressor

    The objective of this predictive model is to estimate the number of days each user is expected to place orders in the upcoming month. By forecasting user behavior at this level of granularity, the logistics department can better anticipate demand patterns, optimize resource allocation, and streamline operational efficiency. This model serves as a critical tool for enhancing the accuracy of demand planning, ensuring timely inventory management, and reducing potential bottlenecks in the supply chain. Through advanced predictive analytics, we aim to provide actionable insights to support decision-making and drive a more proactive logistics strategy.

  3. Demand Prediction

    Algorithm: Regression - ARIMA

    The primary objective of this analysis is to predict demand for the upcoming weeks using advanced time series forecasting techniques. By leveraging historical data and identifying trends, seasonality, and patterns, the model aims to provide accurate demand projections. These insights are crucial for optimizing inventory management, improving supply chain efficiency, and ensuring that resources are allocated effectively to meet future requirements. This predictive approach empowers businesses to adopt a proactive strategy, minimize waste, and enhance customer satisfaction by staying ahead of demand fluctuations.

Advanced Analytics

  1. Woman Violence Analysis

    Method: Exploratory Data Analysis

    The incidence of violence against women in the state of Minas Gerais represents a significant social concern. The lack of detailed analysis of available data hinders the understanding of patterns, critical areas, and underlying factors of this phenomenon, thereby impeding the implementation of targeted and effective strategies to combat this issue. In light of this, the idea emerged to create an exploratory data analysis using data from the State of Minas Gerais from Brazil.

  2. Body Fat Prediction

    Method: Exploratory Data Analysis focused on Linear Regression Inference

    Body fat prediction using machine learning (ML) aims to estimate body fat percentage based on individual characteristics. This process involves data collection, deeply analyzing the linear regression, and evaluating its performance to predict body fat, supporting health diagnostics and monitoring.

ML Ops and Cloud

  1. Azure Data Lake - Medallion Architecture

    Method: Medallion Architecture in Data Lake using Azure

    Education is a crucial aspect of human and social development, playing a fundamental role in creating more just and egalitarian societies. Brazil, with its diversified cultural composition and numerous ethnic groups, including indigenous communities that contribute significantly to the nation's identity, requires special consideration for indigenous education. It is important to prioritize the preservation of cultural traditions and the empowerment of native communities. Due to this, it will be built a data lake environment using Azure with the medallion architecture, performing ETL and the final analysis.

  2. Machine Learning Operations

    Method: ML Flow and Unit Tests

    This repository demonstrates how to effectively use MLflow to manage and track machine learning experiments while exploring the full lifecycle of a data science project. It also highlights the use of unit tests to ensure code quality and reliability throughout the process.

Pinned Loading

  1. woman-violence-analysis woman-violence-analysis Public

    This is a MASTERS DEGREE PROJECT in Data Science and Analytics, discipline Data Analysis and Best Practices. The project is an exploratory data analysis of woman violance in a state of Brazil.

    Jupyter Notebook

  2. azure-data-lake azure-data-lake Public

    This is a MASTERS DEGREE PROJECT in Data Science and Analytics, discipline Data Engineering. The project is development of a data lake house using medallion architecture in Azure.

    Jupyter Notebook 1 1

  3. delivery-delay-prediction delivery-delay-prediction Public

    This is a MASTERS DEGREE PROJECT in Data Science and Analytics, discipline Machine Learning and Analytics. The project is a machine learning supervised problem of classification regarding delayed d…

    Jupyter Notebook

  4. demand-forecasting-and-prediction demand-forecasting-and-prediction Public

    This projecs aims to predict the demand by using time series analysis.

    Jupyter Notebook

  5. mlops-cycle mlops-cycle Public

    The project aims to create an environment in ML Flow using Anaconda and applying unitary tests to a machine learning model, prepating it to production.

    Python 1

  6. order-days-prediction order-days-prediction Public

    This projecs aims to predict the order days within the next month to support the logistics department. It's a regression problem.

    Jupyter Notebook 1