Breaking into baseball analytics requires a combination of technical skills, self-directed research projects, and strategic networking. There is no single path, but the people who land jobs in MLB front offices almost always share three things: strong programming ability, a portfolio of original analysis, and a deep understanding of modern baseball metrics. Here’s how to build each of those from scratch.
Learn the Right Technical Skills
Two programming languages dominate baseball analytics: R and Python. R is the go-to for statistical analysis, data visualization, and reporting. Python covers a broader range, from data cleaning to machine learning and more complex engineering tasks. You don’t need to master both immediately, but you should be comfortable in at least one before applying anywhere. SQL is the other essential tool, since nearly every team stores its data in relational databases and expects analysts to query them directly.
Beyond coding, you need a genuine foundation in statistics. Not just “I took a stats class” but the ability to build regression models, understand probability distributions, and think critically about sample size. If someone hands you a pitcher’s spin rate data from April and asks whether it predicts second-half performance, you should know how to approach that question rigorously.
Free resources are everywhere. Python and R both have dedicated baseball analysis libraries and tutorials. Start with a structured online course in your chosen language, then immediately begin applying what you learn to real baseball data.
Where to Find Free Baseball Data
You don’t need a subscription or an MLB connection to start doing real analysis. Several public sources provide professional-grade data.
- Baseball Savant is the primary public source for Statcast data. Full Statcast datasets aren’t publicly available, but Baseball Savant lets you access batted ball variables like exit velocity, launch angle, and sprint speed for every play going back to 2015.
- FanGraphs offers comprehensive historical statistics, leaderboards, and advanced metrics for both hitters and pitchers.
- Retrosheet provides play-by-play data going back decades, useful for historical analysis and building large datasets.
- Lahman Database is a freely downloadable database covering major league statistics from 1871 to the present, ideal for practicing SQL queries.
The pybaseball library in Python and the baseballr package in R both allow you to pull data directly from these sources into your coding environment with just a few lines of code.
Understand Modern Metrics
If you want to work in baseball analytics, you need fluency in the metrics teams actually use. The field has moved well beyond batting average and ERA.
Expected Weighted On-base Average (xwOBA) is one of the most important modern metrics. It takes every batted ball a hitter produces and assigns expected outcomes based on exit velocity and launch angle, using the results of comparable batted balls across the Statcast era. For weakly hit balls, a batter’s sprint speed is also factored in. The key insight is that xwOBA removes defensive influence entirely. A line drive hit right at a fielder and a line drive that drops for a hit look the same in xwOBA, because the hitter’s quality of contact was identical. This makes it more reflective of true skill than traditional stats. Marcell Ozuna’s 2018 season illustrates this well: his actual wOBA was .327, but his xwOBA was .359, suggesting he was unlucky on batted balls he hit hard.
Other metrics worth learning inside and out include Stuff+ (which grades pitch quality based on movement, velocity, and release point), run expectancy matrices, pitch tunneling concepts, and sprint speed applications. Read the Statcast glossary on MLB.com until these feel like second nature.
Build a Portfolio That Shows Your Thinking
A portfolio of original projects is the single most important thing you can create. It matters more than your degree, more than your GPA, and often more than your work experience. Teams want to see how you think through a baseball problem, not just that you can code.
Strong portfolio projects go beyond describing what happened and instead model what should have happened or what will happen next. Some examples of the kind of work that stands out: building a model to predict whether an outfielder can throw out a runner at home plate based on player positioning and game situation, evaluating whether a third base coach made the right decision to send or hold a runner using play-by-play data, or creating a metric that grades pitch sequencing effectiveness. These projects show you can frame a baseball question, gather relevant data, build a model, and communicate the results clearly.
Visualizations matter too. A clean chart that communicates a finding at a glance demonstrates a skill teams value highly, since analysts frequently present to coaches and executives who don’t read code. Host your projects on GitHub and write up your methodology in plain language alongside the code.
Publish Your Work Where People Will See It
FanGraphs operates a community blog where anyone can submit original baseball analysis. Posts go through an approval process that typically takes up to 48 hours. The editorial team selects the best submissions to appear on the community blog, and exceptionally strong pieces get featured on the homepage. Submissions need to be at least 250 words, well-written, factually correct, and make creative use of statistics to support an original point. Getting a piece published on FanGraphs is a genuine credential in this space, and hiring managers notice it.
Beyond FanGraphs, consider starting your own blog or posting analysis threads on social media platforms where the baseball analytics community is active. Consistency matters. Publishing one strong piece per month for a year builds a body of work that tells a hiring manager you’re serious and self-motivated.
What to Study in School
The overwhelming consensus among people working in baseball operations is to avoid sports management programs and instead pursue something quantitative. Statistics is the most commonly recommended major, followed closely by computer science, data science, applied mathematics, and economics. Any of these give you transferable skills that are valuable both inside and outside baseball, which matters because front office jobs are scarce and competitive.
A graduate degree isn’t required but can help, particularly a master’s in statistics or data science for analyst roles, or an MBA if you’re targeting the business and strategy side of baseball operations. Several universities now offer sports analytics concentrations or certificate programs that provide structured coursework and capstone projects with real teams.
Attend Conferences and Network
Several annual conferences serve as gathering points for the baseball analytics community. The SABR Analytics Conference and the MIT Sloan Sports Analytics Conference are the two biggest names. The Carnegie Mellon Sports Analytics Conference, the Midwest Sports Analytics Meeting, and the UConn Sports Analytics Symposium are smaller but still valuable, and some waive registration fees for presenters. Submitting a research paper or poster to one of these events is a strong resume item and puts you in direct contact with people who hire for MLB teams.
Networking doesn’t require a conference badge, though. Following and engaging with analysts on social media, responding thoughtfully to published research, and sharing your own work builds relationships over time. Many current MLB analysts got their start by being visible and active in the online sabermetrics community.
Landing Your First Role
Most people enter MLB analytics through internships or seasonal positions rather than full-time hires. Teams and MLB itself post internship openings with application deadlines that often fall in November or December for the following summer. The league’s own Baseball Operations internship, for example, has used a mid-December deadline. These timelines mean you should have your portfolio, resume, and cover letter ready by early fall.
Entry-level analyst salaries in MLB are modest compared to what the same skills command in tech or finance. An analyst at Major League Baseball earns roughly $58,000 per year, while senior analysts make around $74,000. Data scientist roles pay in the range of $105,000, and senior data scientists around $120,000. These numbers are competitive with the national average for data work but significantly lower than what companies like Google or Goldman Sachs offer for similar technical profiles. Most people in baseball analytics are there because they love the game, and they accept the tradeoff knowingly.
Because the industry is small (30 teams, limited front office budgets), rejection is normal. Many successful analysts applied to dozens of positions over multiple years before getting their first opportunity. The differentiator is almost always the quality and originality of their independent work.

