Voleon is a technology company that applies state-of-the-art AI and machine learning techniques to real-world problems in finance. For nearly two decades, we have led our industry and worked at the frontier of applying AI/ML to investment management. We have become a multibillion-dollar asset manager, and we have ambitious goals for the future.
Β
Your colleagues will include internationally recognized experts in artificial intelligence and machine learning research as well as highly experienced finance and technology professionals.
Β
In addition to our enriching and collegial working environment, we offer highly competitive compensation and benefits packages, technology talks by our experts, a beautiful modern office, daily catered lunches, and more.
The Voleon Group is growing its Feature Engineering team β a small, high-impact group responsible for turning the world's messy, complex datasets into predictive signals that power our machine learning models. As a Data Scientist on this team, you'll dig into raw data from diverse domains and, in collaboration with Research, assist in design and implementation of features that capture what is actually happening in the world. This is data storytelling at its most consequential: every feature you build has a direct path to our investment process.
We're looking for deeply curious people who get genuine satisfaction from wrestling with an unfamiliar dataset, understanding its structure and quirks, and emerging with something that encodes real information. You'll own the full arc from data sourcing and curation through feature construction, statistical validation, and integration into our production systems. We also expect you to actively experiment with AI-powered tools β LLM-based coding assistants, agentic workflows, and whatever comes next β to accelerate your day-to-day work and push the boundaries of what a small team can accomplish.
Responsibilities
β’ Explore, profile, and curate complex and often messy datasets from third-party vendors and internal sources, developing a deep understanding of what each dataset can and cannot tell us
β’ Harness financial intuition, academic research, and statistical rigor to inform design and implementation of predictive features in collaborative setting
β’ Validate features through a disciplined, test-driven framework β including cross-sectional analysis, stationarity testing, and point-in-time correctness β to ensure signals are real and not artifacts of data issues
β’ Build and maintain data pipelines that bring features from prototype to production, with monitoring for data health and correctness along the way
β’ Communicate your findings clearly β both the signal you've found and the story of how the data produces it β to researchers and leadership
β’ Proactively investigate anomalies in data feeds and production behavior, performing root-cause analysis and surfacing issues to relevant stakeholders
β’ Leverage AI tools to accelerate exploration, coding, and analysis β and share what you learn about effective workflows with the team
Requirements
β’ 2 years of applied industry experience (including internships) working end-to-end with complex datasets: curation, querying, aggregation, exploratory analysis, and visualization
β’ Experience using statistical methods to analyze data, identify patterns, conduct root-cause analysis, and translate findings into actionable insights
β’ Ability to frame and answer questions mathematically
β’ Ability to infer useful forward-looking directions from the results of retrospective analysis
β’ Fluency in managing, processing, and visualizing tabular data using SQL and Python (Pandas or Polars)
β’ Basic software development skills and experience with bash, Linux/Unix, and git
β’ Ability to refine ambiguous requests into well-scoped analyses and communicate results with clarity and precision
β’ Bachelor's degree in a quantitative discipline (statistics, data science, computer science, economics, physics, or a related field)
Preferred
β’ Master's degree in a quantitative discipline
β’ Prior industry experience or demonstrated interest in finance β academic projects, coursework in financial engineering, or industry internships
β’ Familiarity with financial datasets such as Compustat, IBES, or similar vendor data
β’ Experience developing in a production-facing environment with standard tooling (CI/CD, git, workflow orchestration)
β’ Hands-on experience with AI coding assistants or LLM-based tools in a data science or engineering workflow
β’ A track record of curiosity-driven exploration β side projects, Kaggle competitions, research papers, or anything that shows you can't leave an interesting dataset alone
Compensation
The base salary range for this position is $150,000 to $190,000 in the location(s) of this posting. Individual salaries are determined through a variety of factors, including, but not limited to, education, experience, knowledge, skills, and geography. Base salary does not include other forms of total compensation, such as bonus compensation and other benefits. Our benefits package includes medical, dental, and vision coverage, life and AD&D insurance, 20 days of paid time off, 9 sick days, and a 401(k) plan with a company match.
Β
βFriends of Voleonβ Candidate Referral Program
If you have a great candidate in mind for this role and would like to have the potential to earn $7,500 if your referred candidate is successfully hired and employed by The Voleon Group, please use thisΒ formΒ to submit your referral. For more details regarding eligibility, terms, and conditions, please make sure to review theΒ Voleon Referral Bonus Program.
Β
Equal Opportunity Employer
The Voleon Group is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.
Β
#LI-MD1