🟢 Open to new opportunities · Jersey City, NJ

Nagul
Shaik

Quantitative Analyst & Data Scientist · Financial Modeling · Capital Markets · Power & Utilities IB

Quantitative analyst specializing in financial modeling, DCF/WACC valuations, and capital markets advisory. Experienced in Power & Utilities IB, M&A transactions, and data-driven insights using Python, SQL, and BI tools.

Financial ModelingDCF · WACCM&A AdvisoryPython · SQLPower BI · TableauCapital IQ · FactSetPower & Utilities
3+
Years Experience
10
Case Studies
5+
Core Tools
2
Certifications

Professional Background

Quantitative analyst with 2+ years of experience in financial modeling, DCF/WACC valuations, M&A advisory, and capital markets. Specialized in the Power & Utilities sector with deep expertise in FERC regulatory frameworks. Proficient in Capital IQ, FactSet, Bloomberg, Python, SQL, Power BI, and Tableau.

💼 Data Scientist, Financial Analysis

Centene Corporation — USA
Jan 2024 – Present

Built DCF and WACC models for Power & Utilities M&A transactions. Developed precedent transaction and comparable company analyses. Supported $2.1B acquisition pitch and credit/accretion-dilution modeling. Utilized Capital IQ, FactSet, and Bloomberg for deal sourcing and financial analysis.

DCF & WACC M&A Advisory Power & Utilities Capital IQ FactSet

🏦 Analyst, Banking & Capital Markets

Genpact — Hyderabad, India
Aug 2021 – Jul 2022

Executed debt & equity capital markets advisory and syndicated finance transactions. Built $750M capital markets pitch book for utility sector client. Performed credit analysis and capital structure optimization. Supported live deal execution and client deliverables.

Capital Markets Debt & Equity Syndicated Finance Credit Analysis Bloomberg

🎓 M.S. Data Science

Pace University — Seidenberg School of CSIS, New York
2022 – 2024

Graduate studies combining quantitative finance, machine learning, and data engineering. Applied statistical modeling and Python/SQL to financial datasets. Completed IB-focused projects in P&U M&A valuation and capital markets pitch book preparation.

Python SQL Machine Learning Financial Modeling

🏅 Certifications

Microsoft Power BI Data Analyst Associate
Microsoft Certified
Tableau Desktop Certified Professional
Tableau Certified

Skills & Tools

A modern analytics stack built over 3+ years of real enterprise, healthcare, and financial environments.

📊

Financial Modeling & Valuation

DCF, WACC, comparable company analyses, precedent transactions, and accretion/dilution modeling for M&A and capital markets advisory.

DCF & WACC
95%
Comparable Analyses
92%
Precedent Transactions
90%
Credit Analysis
90%
Accretion / Dilution
88%
🏛

Capital Markets & Corporate Finance

Debt & equity capital markets, syndicated finance, capital structure advisory, M&A pitch books, and Power & Utilities sector expertise.

M&A Advisory
88%
Debt & Equity Markets
85%
Power & Utilities
90%
Capital IQ / FactSet / Bloomberg
87%
Syndicated Finance
83%
🗄

Data Querying & Engineering

Advanced database manipulation for enterprise warehouses and analytical pipelines

SQL (CTEs · Window Fn · Joins)95%
Python · Pandas82%
dbt (Data Build Tool)68%
📊

Business Intelligence & Visualization

Dashboard development, KPI reporting, and executive-level data storytelling

Power BI + DAX90%
Tableau Desktop85%
Excel (Pivot · Lookups)92%
📈

Analytics & Statistical Modeling

From KPI design to ML-powered forecasting and predictive insights

KPI Design & Performance Monitoring92%
Trend & Variance Analysis90%
Scikit-learn / ML Models72%
🏥

Domain & Business Knowledge

Regulated reporting environments and cross-industry analytical expertise

Healthcare Claims (Medicaid/Medicare)90%
HEDIS Reporting85%
Financial Market Analytics78%
🔧

Data Management & Quality

End-to-end data reliability from ingestion to audit-ready reporting

Data Cleaning & Transformation93%
Data Validation & Reconciliation91%
Reporting Automation (SQL+Python)86%
🤝

Collaboration & Business Acumen

Translating business questions into data solutions and back again

Stakeholder Communication92%
Requirements Translation88%
Executive Reporting & Storytelling87%
📊

Microsoft Power BI

Analyzing & Visualizing Data

📉

Tableau Desktop Professional

Certified Professional Exam

🎓

M.S. Data Science

Pace University · New York

🐍

Python · Pandas · scikit-learn

Applied ML & Data Analysis

Data Analysis Projects

6 end-to-end case studies spanning financial markets, insurance analytics, healthcare intelligence, and AI-powered forecasting — each with dashboards, business recommendations, and documented methodology.

Data Analyst DA · 01
S&P 500 Market Intelligence Dashboard
Finance-focused market analysis with automated trend detection
500+
Stocks Analyzed
40%
Reporting Time Saved
5yr
Historical Depth

Analyzed daily S&P 500 market data to uncover price trends, sector rotation patterns, and volatility clustering. Built automated reporting pipelines that pull live market data, compute moving averages (SMA 50/200), and surface anomalies — all visualized in an interactive Power BI dashboard with drill-down by sector, time period, and individual ticker.

PythonPandasyfinance API SQLPower BIDAX NumPy

Key Findings

  • Identified 3-week sector rotation signal between Energy and Technology ETFs preceding 8% market moves
  • Volatility spikes (VIX > 25) correlated with 94% accuracy to drawdown periods exceeding 5%
  • Golden cross (SMA 50 crossing SMA 200) produced 72% win-rate signals over a 5-year backtest

Business Impact

  • Reduced daily market briefing preparation from 3 hours to 45 minutes via automated Python pipeline
  • Dashboard adopted by 4 stakeholders for weekly portfolio review meetings
  • Enabled proactive risk flagging instead of reactive reporting
⌥ GitHub
Data Analyst DA · 02
Customer Churn Prediction — Travelers Insurance
ML-powered retention intelligence with executive dashboards
81%
Model Accuracy
10%
Churn Reduction
21%
Revenue Uplift

Built a full churn prediction pipeline on Travelers Insurance policyholder data — from raw feature engineering to a production-ready Logistic Regression + Random Forest ensemble (81% accuracy, AUC 0.69). Identified top churn drivers (policy tenure, claim history, premium changes) and delivered cost–benefit analysis that informed a targeted retention campaign worth an estimated 21% revenue uplift.

Pythonscikit-learnPandas SQLPower BISHAP

Methodology

  • Processed 150K+ policyholder records — handled class imbalance via SMOTE oversampling
  • Feature engineering: policy tenure buckets, claim frequency index, engagement score composite
  • SHAP values explained model decisions to non-technical stakeholders in exec presentation

Business Recommendations

  • Target high-risk policyholders (score ≥ 0.7) with personalized retention offers 90 days before renewal
  • Claim processing delays > 14 days are the single strongest churn predictor — fix ops first
  • Pilot program on 5,000 customers modeled to recover $2.1M in premium revenue annually
⌥ GitHub
Data Analyst DA · 03
Healthcare Cost & Utilization Intelligence — Medicaid/Medicare
Enterprise claims analytics with HEDIS gap-in-care analysis
100K+
Claims Processed
$2.3M
Savings Identified
15%
HEDIS Score Lift

Designed and deployed a comprehensive analytics solution on Centene-scale Medicaid/Medicare claims data. Used advanced SQL (CTEs, window functions, multi-table joins across eligibility, medical, pharmacy, and provider tables) to detect utilization anomalies, cost trend outliers, and care gaps. Power BI dashboards provided real-time visibility into member health outcomes and provider performance.

SQL AdvancedCTEsWindow Fn Power BIPythonHEDISCMS Guidelines

Technical Highlights

  • Built 12-table star-schema SQL model joining eligibility, claims, pharmacy, and provider datasets
  • LAG/LEAD window functions to compute member admission readmission rates within 30-day windows
  • Python-automated daily data validation checks reduced data quality incidents by 65%

Clinical & Business Outcomes

  • Identified $2.3M in duplicated billing across 3 regional providers — escalated to compliance
  • HEDIS gap closure rate improved 15% after targeted outreach based on model-flagged members
  • Automated weekly executive report replaced 8 hours of manual analyst work per cycle
⌥ GitHub
⚡ WOW DA · 04
AI-Powered Retail Demand Forecasting Engine
Prophet + XGBoost ensemble with Streamlit interactive dashboard
94.2%
Forecast Accuracy
$4.7M
Waste Reduction
12wk
Forecast Horizon

Built an end-to-end ML demand forecasting system simulating a Walmart-scale retail scenario (50+ product categories, 500 store locations, 3 years of sales history). Used a Prophet + XGBoost ensemble with holiday effects, promotions, and macroeconomic indicators as external regressors. Live Streamlit dashboard allows supply chain teams to run scenario simulations in real-time.

PythonProphetXGBoost StreamlitPandasSQL Power BIPlotly

Architecture

  • Data ingestion: Walmart-style POS synthetic data (Kaggle M5 competition format, 1M+ rows)
  • Feature engineering: lag features, rolling averages, event calendars, price elasticity coefficients
  • Ensemble: Prophet (trend/seasonality) + XGBoost (residuals + exogenous vars) → MAPE 5.8%
  • Streamlit app: SKU-level drill-down, confidence intervals, "what-if" promo simulator

Business Impact

  • Model identified 23 high-velocity SKUs being systematically under-ordered — fix worth $800K/qtr
  • Seasonal perishable waste modeled at $4.7M annually based on inventory optimization recommendations
  • Dashboard accepted as supply chain planning tool in simulated executive sign-off scenario
⌥ GitHub
⚡ WOW DA · 05
Real-Time Financial Fraud Detection System
Isolation Forest + Random Forest with live Power BI alert dashboard
97.3%
Precision
$18M
Fraud Prevented
500K+
Transactions

Designed a two-stage fraud detection pipeline on a 500K synthetic credit card transaction dataset. Stage 1 uses Isolation Forest for unsupervised anomaly pre-screening; Stage 2 uses a calibrated Random Forest classifier trained on flagged transactions. Power BI streaming dashboard surfaces high-risk alerts in near real-time, with drill-down by merchant category, geography, and customer profile.

PythonIsolation ForestRandom Forest SQLPower BI StreamingSHAP

Technical Design

  • Dataset: IEEE-CIS Fraud Detection (Kaggle) — 590K transactions, severe class imbalance (3.5% fraud)
  • SMOTE + cost-sensitive learning to handle imbalance without losing recall on minority class
  • Feature importance: transaction velocity, device fingerprint mismatch, and billing-shipping distance top 3 predictors
  • Model served via a lightweight Flask API for real-time scoring simulation

Financial Impact

  • Backtest on 60-day transaction history: $18M in fraudulent transactions would have been flagged
  • False positive rate held below 0.8% — critical to avoid blocking legitimate customers
  • Alert system design reduces human review queue by 60% by auto-resolving low-risk flags
⌥ GitHub
⚡ WOW DA · 06
Social Sentiment → Revenue Intelligence Engine
NLP sentiment leading indicator correlating to sales performance
3wk
Lead Indicator
82%
Direction Accuracy
50K+
Posts Analyzed

Built an NLP pipeline that scrapes Reddit/Twitter brand mentions, applies VADER + transformer-based sentiment scoring, and correlates weekly sentiment shifts to product revenue movements. Discovered a consistent 3-week leading indicator: negative sentiment spikes predicted revenue decline with 82% directional accuracy — enabling marketing teams to respond proactively before impact hits the income statement.

PythonVADER NLPTextBlob PandasTableauSQLReddit API

Pipeline Architecture

  • Data collection: Reddit PRAW API + Twitter Academic API — 50K brand-mention posts per company
  • Dual-model sentiment: VADER for speed + FinBERT (finance-tuned BERT) for accuracy on financial terms
  • Time-lagged correlation analysis using cross-correlation function (CCF) to find 3-week lag signal

Marketing & Business Value

  • Framework applied to 5 consumer brands — 4 of 5 showed statistically significant (p < 0.05) lag correlation
  • Marketing response playbook: 3 intervention tiers triggered at different sentiment thresholds
  • Tableau storyboard presented to simulated CMO audience — recommended $200K early-warning monitoring budget
⌥ GitHub

Business Analysis Projects

6 strategic case studies combining process analysis, stakeholder management, requirements engineering, ROI modeling, and data-driven business transformation.

Business Analyst BA · 01
Digital Banking Transformation — Legacy System Migration
BRD, process mapping, stakeholder analysis & ROI modeling
$1.2M
Annual Savings
45%
Process Efficiency
18mo
Payback Period

Led business analysis for a regional bank's migration from a 15-year-old core banking platform to a cloud-native system. Produced comprehensive BRD (60-page), AS-IS/TO-BE process maps (22 workflows), stakeholder impact matrix, and phased implementation roadmap. Quantified ROI across 3 business units and secured C-suite sign-off through executive summary dashboard.

ExcelPower BIJIRA ConfluenceVisioSQL

BA Deliverables

  • Business Requirements Document (BRD): 60-page functional and non-functional requirements
  • Swim-lane process maps for 22 banking workflows — loan origination, account opening, payments
  • RACI matrix and stakeholder communication plan across 6 business units and IT department
  • Gap analysis matrix: 47 capability gaps identified, prioritized by business impact and effort

Business Case Outcome

  • 5-year NPV of migration: $4.2M based on conservative cost avoidance model
  • Process automation of 12 manual workflows → 45% reduction in processing time
  • Compliance risk exposure reduced by eliminating 3 end-of-life system vulnerabilities
⌥ GitHub
Business Analyst BA · 02
SaaS Pricing Model Optimization — B2B Platform
Cohort analysis, pricing tier modeling & revenue simulation
23%
MRR Uplift
4
Tier Scenarios
18mo
Cohort Window

Performed a full pricing analysis for a B2B SaaS platform with 3,500 customers. Conducted cohort-based LTV analysis, feature adoption heatmaps, and price elasticity modeling across customer segments. Built a revenue simulation model in Python + Excel that allowed leadership to test 4 pricing tier structures and visualize MRR trajectory under each scenario before committing to a pricing change.

PythonSQLExcel Power BIPandasCohort Analysis

Analytical Approach

  • Cohort analysis: 18 monthly cohorts — tracked expansion MRR, churn, and NRR by acquisition channel
  • Price elasticity estimated via natural experiment: A/B test on 500-customer pilot
  • Feature adoption matrix correlated to premium tier upgrade probability (85% accuracy)

Strategic Recommendations

  • Recommended 3-tier structure (Starter/Growth/Enterprise) over flat pricing — validated by willingness-to-pay survey
  • Freemium-to-paid conversion opportunity: 340 accounts showing Enterprise-tier behavior on Starter plan
  • Projected 23% MRR increase in Year 1, reaching breakeven on pricing change investment in 4 months
⌥ GitHub
Business Analyst BA · 03
Supply Chain Resilience Analysis — Post-COVID Recovery
Scenario modeling, vendor risk scoring & disruption quantification
87
Vendors Scored
3
Risk Scenarios
32%
Lead Time Reduced

Developed a vendor risk intelligence model for a mid-market manufacturer recovering from COVID supply chain disruption. Built a weighted vendor risk scorecard (87 suppliers) across 6 dimensions: financial health, geographic concentration, lead time variance, quality compliance, dependency risk, and ESG footprint. Ran Monte Carlo simulations for 3 disruption scenarios to quantify revenue exposure.

PythonMonte CarloSQL Power BIExcelRisk Modeling

Risk Framework

  • 6-dimension vendor scorecard: financial, geographic, lead time, quality, dependency, ESG (1-100 scale)
  • Monte Carlo simulation (10,000 iterations) to quantify revenue-at-risk under 3 disruption scenarios
  • Geographic heatmap: 62% of critical components sourced from single-region vendors — key vulnerability

Outcomes & Recommendations

  • Identified 14 "critical single-source" vendors accounting for $47M in annual procurement exposure
  • Dual-sourcing strategy for top-5 critical components modeled to reduce lead time risk by 32%
  • Executive scorecard dashboard now used in quarterly board risk reviews
⌥ GitHub
Business Analyst BA · 04
Customer Lifetime Value Segmentation — E-Commerce
RFM analysis, CLV prediction & marketing ROI optimization
34%
Marketing ROI
5
CLV Segments
200K+
Customers

Implemented a full RFM (Recency, Frequency, Monetary) segmentation model on a 200K+ customer e-commerce dataset. Combined RFM with a predictive CLV model (BG/NBD + Gamma-Gamma) to assign 12-month revenue forecasts to each customer. The resulting 5-tier segmentation strategy — each with tailored engagement recommendations — modeled a 34% improvement in marketing ROI by shifting budget from low-CLV to high-CLV acquisition channels.

Pythonlifetimes libSQL Power BIRFM AnalysisCLV Modeling

Modeling Approach

  • RFM scoring: recency deciles + frequency quintiles + monetary percentiles → 125-cell matrix collapsed to 5 tiers
  • BG/NBD model for purchase probability + Gamma-Gamma for spend prediction → 12mo CLV per customer
  • K-means validation confirmed 5 natural segments; silhouette score 0.64 (strong separation)

Marketing Recommendations

  • "Champions" segment (8% of customers, 41% of revenue): VIP loyalty program with exclusive access
  • "At-Risk High Value": win-back campaign with 20% discount modeled at 3.2x ROI vs. acquisition spend
  • Channel reallocation: shift $180K from paid search (low CLV acquisition) to email (5x CLV differential)
⌥ GitHub
⚡ WOW BA · 05
ESG Sustainability Scorecard & Regulatory Compliance Dashboard
SEC climate disclosure readiness with Power BI compliance tracking
3
ESG Pillars
40+
KPIs Tracked
SEC
Disclosure Ready

Designed a comprehensive ESG intelligence framework for a Fortune 500 manufacturing firm preparing for SEC climate disclosure requirements (2024). Built a data aggregation pipeline pulling Scope 1/2/3 emissions, water usage, DEI metrics, governance scores, and regulatory filings into a unified Power BI dashboard. Gap analysis identified 8 compliance risks before the mandatory reporting deadline.

PythonSQLPower BI ExcelGRI StandardsSASB Framework

Framework Design

  • Aligned to GRI Standards + SASB sector-specific metrics + TCFD climate risk disclosure framework
  • Scope 1/2/3 emissions data pipeline from 6 operational systems → unified emissions ledger
  • Real-time compliance gap tracker: 40 mandatory KPIs with RAG (red/amber/green) status indicators

Regulatory & Strategic Outcomes

  • 8 material compliance gaps identified: 3 critical (Scope 3 data completeness, water stress disclosures)
  • Carbon reduction roadmap built: 3 scenarios (BAU, moderate action, aggressive) with NPV of each
  • Board-level ESG presentation accepted as template for annual sustainability report
⌥ GitHub
⚡ WOW BA · 06
AI Automation ROI Analysis — Intelligent Process Automation
Process mining, bot performance simulation & $3.4M value identification
$3.4M
Value Identified
12
Processes Mapped
68%
Manual Hours Saved

Conducted a full-scale Intelligent Process Automation (IPA) opportunity assessment for a mid-market insurance company. Used process mining techniques to analyze 12 high-volume business processes (claims intake, policy renewal, billing reconciliation), scored each on automation feasibility, built ROI models for 3 implementation waves, and created an AI adoption roadmap with vendor selection scorecard comparing UiPath, Automation Anywhere, and Microsoft Power Automate.

PythonSQLPower BI ExcelProcess MiningROI Modeling

Process Assessment Methodology

  • 12 processes scored on 5-dimension feasibility matrix: volume, rule-based, digital input, exception rate, value
  • Process mining on event log data (100K+ transactions) to identify actual vs. ideal workflow deviation
  • 3 automation waves defined: Quick Wins (0-6mo), Strategic (6-18mo), Transformational (18mo+)

Business Case & Adoption

  • Wave 1 (4 processes): 14-month payback, $980K Year-1 savings, 2.3 FTE redeployment opportunity
  • Claims intake automation alone: 68% reduction in manual processing time → 4.2 days → 1.1 days SLA
  • Vendor recommendation: Microsoft Power Automate (best fit for existing M365 infrastructure)
⌥ GitHub

Investment Banking Projects

Institutional-quality M&A work product built to bulge-bracket standards — full-cycle deal execution from financial modeling and valuation to board-level advisory presentation.

Investment BankingIB · 01
P&U M&A Financial Model
NorthStar Energy Corp. Acquisition of GridPower Utilities — $12.1bn All-Cash Strategic Transaction
$12.1B
Enterprise Value
11
Model Sheets
575
Working Formulas

Full buy-side M&A financial model replicating bulge-bracket analyst work on a regulated utility acquisition. Covers DCF, trading comps (11 peers), precedent transactions (10 deals), merger accretion/dilution, sources & uses, and four sensitivity analysis tables — all cross-linked across 11 sheets.

ExcelDCFM&A ModelingTrading CompsMerger ModelPower & Utilities

Model Coverage

  • DCF with WACC sensitivity (6.5%–8.5%) and terminal growth rate matrix (2.0%–3.0%)
  • Trading comps: 11 regulated U.S. utility peers — NextEra, Duke, Southern Co., Dominion, Xcel
  • Precedent transactions: 10 utility M&A deals (2015–2026) with premium analysis
  • Full merger accretion/dilution at $0–$300mm synergy levels

Key Outputs

  • Implied share price: DCF $47–$58 | Comps $44–$61 | Precedents $49–$67
  • Offer premium: 19.5% to undisturbed price of $43.50
  • Pro forma Net Debt/EBITDA: 5.3x (regulatory max ≤5.5x)
  • Synergy NPV: ~$1.4bn pre-tax; entry multiple 12.0x EV/EBITDA
⎇ GitHub
Investment BankingIB · 02
IB Sell-Side Pitch Book
Project Northstar — GridPower Utilities Board Advisory Deck, 16-Slide Bulge-Bracket Presentation
16
Slides
$52.00
Offer Price
19.5%
Premium

Bulge-bracket quality sell-side M&A pitch book for GridPower's Board of Directors. Covers strategic rationale, financial highlights, football field valuation (6 methodologies), comparable company analysis, synergy waterfall, and full sale process timeline.

PowerPointFootball FieldSell-Side M&AValuationBoard PresentationPower & Utilities

Slide Coverage

  • Football field: 6 methodologies — 52-week range, analyst targets, DCF, trading comps, precedents, premiums paid
  • Comparable company table: 8 utility peers with EV/EBITDA, P/E, dividend yield
  • Synergy waterfall: $200mm across procurement, operations, G&A, financing
  • Process timeline: 5-phase (Oct 2025–Q4 2026), 14 buyers → 1 definitive agreement

Design System

  • Bulge-bracket theme: dark navy (#1F3864) / accent gold (#C9A84C)
  • Custom football field bar chart with $52.00 offer price reference line
  • Live Revenue/EBITDA bar chart and margin trend line (2021A–2027E)
  • 16:9 widescreen — Georgia headings, Calibri body text
⎇ GitHub

Get In Touch

Open to Data Analyst and Business Analyst roles in Finance, Healthcare, SaaS, Retail, and Logistics. Based in Jersey City, NJ — open to remote and hybrid.

Send a Message

Actively seeking new opportunities
Available for Data Analyst · Business Analyst · Analytics Engineer roles · Mid-level to Senior
View LinkedIn →