Skip to content
View ArturoSbr's full-sized avatar
:shipit:
:shipit:
  • Google
  • Mexico City

Block or report ArturoSbr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ArturoSbr/README.md

Welcome!

My name is Arturo and I'm a highly caffeinated data scientist with 10 years of experience in the fields of tech and finance. I love training machine learning models from scratch and estimating causal effects from non-experimental data. In a nutshell, I use AI to figure out what will happen, and econometrics to understand why it's happening.

I work at @Google as a Behavioral Data Scientist and I also teach Econometrics at @ITAM.

What I'm working on

I'm currently obsessed with transformers and multi-head attention mechanisms.

I recently developed a vanilla PySpark implementation of the Louvain algorithm for community detection as well as a module that uses message passing to automatically engineer graph-based features.

I've also made a few contributions to NetworkX, scikit-learn and GraphFrames in recent months.

Hope you enjoy your stay (;

Popular repositories Loading

  1. youtube-dislike-count youtube-dislike-count Public

    Effect of removing the dislike count on the ratio of negative comments posted on political YouTube videos.

    Jupyter Notebook 4

  2. aws-ccp-cheat-sheet aws-ccp-cheat-sheet Public

    Cheatsheet to pass the AWS Certified Cloud Practitioner exam.

    4 1

  3. econometrics-ii-2025 econometrics-ii-2025 Public

    Econometrics II workshop (2025).

    Jupyter Notebook 4

  4. Information-values-in-Python Information-values-in-Python Public

    Custom Python function that calculates the Weight of Evidence and Information Value for a given feature. This bivariate method of feature selection is used to determine the relationship between a d…

    Jupyter Notebook 1

  5. Multivariate-feature-imputation-in-Python Multivariate-feature-imputation-in-Python Public

    Model an independent variable as a function of other independent variable to impute its missing values.

    Jupyter Notebook 1

  6. logistic-regression-for-null-value-imputation-in-Python logistic-regression-for-null-value-imputation-in-Python Public

    Custom Python function to impute missing values by fitting a logistic regression on the non-empty data and then impute missing values according to the observed event rate of the empty population.

    Python 1