0%
B.Sc. Statistics & Data Scientist

DECODING
DATA
DYNAMICS.

Hi, I'm Bhaskar Jha. Bridging the gap between rigorous mathematical theory and modern data engineering. Building intelligent tools with Python, SQL, and Deep Learning. Open source contributor at Open Food Facts.

Bhaskar Jha
MACHINE LEARNING DATA ENGINEERING STATISTICS PYTHON DEEP LEARNING MACHINE LEARNING DATA ENGINEERING STATISTICS PYTHON DEEP LEARNING
0
Data Tools Built
0
B.Sc. Stats Cohort
0
GitHub Contributions
GSoC
Participant 2025

The
Journey

2025 — PRES

Google Summer of Code

Contributor for Open Food Facts. Navigating large-scale legacy codebases (Perl, HTML) and fixing critical UI persistence bugs. Bridging the gap between stats and global production systems.

2024 — PRES

Independent Project Builder

Engineering reactive business dashboards, spaced repetition systems, and local EDA automation tools. Deployed multiple high-polish web applications using Python and JS.

ONGOING

B.Sc. Statistics

Deep specialization in Probability, Inference, and Sampling Distributions. Applying theory to practical data science projects to solve real-world anomalies.

Selected
Work

DataSense EDA

Python NLP Automation

Offline CSV EDA tool with NLP insights, Pearson correlation engine, and predictive ML models for rapid data analysis. Built for efficiency and depth in local environments.

import pandas as pd
from datasense import AutoExplorer

df = pd.read_csv('raw_data.csv')
engine = AutoExplorer(df)
engine.analyze_correlations()
DataSense Interactive User Interface
NexusSales Revenue Intelligence Dashboard

NexusSales

Data Pipeline CRM Analysis Visualization

Full data pipeline transformation. Ingesting messy CRM data and outputting a reactive business dashboard with actionable statistical insights for sales growth.

MNEMO System

Gamification Algorithms Python

Ebbinghaus-based spaced repetition tool. Features an XP-based leveling system and custom scheduling algorithms to optimize information retention for complex subjects.

MNEMO Algorithm Dashboard