FactorEngine: A Program-level Knowledge-Infused Factor Mining Framework for Quantitative Investment

Imagine you are trying to find the perfect recipe for a dish that will make you rich, but the ingredients (the stock market) change flavor every single day. Sometimes it's spicy, sometimes bland, and sometimes it's toxic.

For decades, investors have tried to find these "magic recipes" (called Alpha Factors) using two main methods:

The Manual Chefs: Humans write strict rules (like "if the price drops, buy"). This is slow and hard to scale.
The Black Box AI: Neural networks guess the recipe by tasting millions of dishes. They are good at guessing, but you have no idea why they chose that recipe, and they often fail when the ingredients change.

FactorEngine (FE) is a new, third way. Think of it as a Super-Chef Robot that doesn't just guess or follow strict rules. Instead, it writes its own cooking code, learns from its mistakes, and constantly improves its recipes.

Here is how FactorEngine works, broken down into simple analogies:

1. The "Bootstrapping" Phase: Reading the Cookbook

Before the robot starts cooking, it needs inspiration.

The Old Way: Humans read financial reports and try to guess what the robot should do.
The FactorEngine Way: The robot has a team of AI Agents that read thousands of financial reports. They don't just summarize them; they extract the core idea (e.g., "When volume spikes and price jumps, the stock is likely to keep going up") and immediately turn that idea into executable computer code.
The Analogy: Imagine a team of translators who read a chef's handwritten notes and instantly turn them into a working robot program. This gives the robot a massive head start with "knowledge-infused" recipes.

2. The "Evolution" Phase: The Macro-Micro Dance

This is the heart of the system. FactorEngine splits the work into two distinct teams to avoid getting stuck or wasting time.

The Macro Team (The Creative Architect):
- Who: Large Language Models (LLMs), like the smartest chefs in the world.
- Job: They look at the big picture. They ask, "What if we change the logic? What if we look at the stock's behavior over 3 days instead of 1?" They use a "Chain of Experience" (a memory of past successes and failures) to guide their ideas.
- Analogy: This is the architect redesigning the kitchen layout. They decide what to cook and how to approach the problem.
The Micro Team (The Precision Tuner):
- Who: A fast, automated math engine (Bayesian Optimization).
- Job: Once the Architect proposes a new recipe, the Tuner fine-tunes the numbers. "Should the window be 3 days or 3.5 days? Should the decay factor be 0.9 or 0.95?"
- Analogy: This is the sous-chef adjusting the salt and pepper. They don't change the recipe; they just make sure the measurements are perfect.

Why separate them?
If you ask the Creative Architect to also count the salt, they get distracted and slow down. By separating the "Big Ideas" (LLM) from the "Fine Tuning" (Math), FactorEngine runs much faster and smarter.

3. The "Island" Strategy: Avoiding Groupthink

In nature, if a species lives on one island, they might all evolve the same way and die out if the environment changes.

FactorEngine's Solution: It runs multiple independent "Islands" of evolution at the same time.
The Migration: Every few rounds, the best recipes from one island are sent to the others.
The Analogy: Imagine five different cooking competitions happening simultaneously. Every week, the winner of Competition A sends their best dish to Competition B. This ensures that a brilliant idea discovered in one group spreads to everyone, preventing the system from getting stuck in a "local optimum" (a good but not great solution).

4. The "Verification" Loop: Learning from Failure

Most AI systems only learn from what works. FactorEngine is special because it learns from failures.

If a recipe fails (the code crashes or loses money), the system records why it failed.
The "Chain of Experience" remembers these failures so the Architect doesn't make the same mistake twice.
Analogy: It's like a student who keeps a diary of every wrong answer on a test, not just the right ones, so they never miss that specific question again.

The Results: Why It Matters

When tested against the best existing methods (like human-made factors or other AI agents):

Better Predictions: It found signals that were more accurate (higher IC).
More Profit: It generated significantly higher returns (up to 126% more than some baselines).
More Stable: It didn't crash as hard when the market turned (lower drawdown).
Diverse: It found a wider variety of unique strategies, rather than just copying the same idea over and over.

Summary

FactorEngine is like a self-improving, multi-agent factory for investment strategies.

It reads human knowledge and turns it into code.
It separates "Big Creative Ideas" from "Mathematical Tweaking" to be super efficient.
It runs parallel experiments to ensure diversity.
It learns from its mistakes to build better, more robust strategies.

Instead of just guessing the market or following rigid rules, FactorEngine writes its own adaptable code, making it a powerful tool for navigating the chaotic, noisy world of finance.

FactorEngine: A Program-level Knowledge-Infused Factor Mining Framework for Quantitative Investment

1. The "Bootstrapping" Phase: Reading the Cookbook

2. The "Evolution" Phase: The Macro-Micro Dance

3. The "Island" Strategy: Avoiding Groupthink

4. The "Verification" Loop: Learning from Failure

The Results: Why It Matters

Summary

1. Problem Statement

2. Methodology: FactorEngine (FE)

Core Architecture

3. Key Contributions

4. Experimental Results

5. Significance

FactorEngine: A Program-level Knowledge-Infused Factor Mining Framework for Quantitative Investment

1. The "Bootstrapping" Phase: Reading the Cookbook

2. The "Evolution" Phase: The Macro-Micro Dance

3. The "Island" Strategy: Avoiding Groupthink

4. The "Verification" Loop: Learning from Failure

The Results: Why It Matters

Summary

1. Problem Statement

2. Methodology: FactorEngine (FE)

Core Architecture

3. Key Contributions

4. Experimental Results

5. Significance

More like this

Exploration and Exploitation Errors Are Measurable for Language Model Agents

SciFi: A Safe, Lightweight, User-Friendly, and Fully Autonomous Agentic AI Workflow for Scientific Applications

Numerical Instability and Chaos: Quantifying the Unpredictability of Large Language Models

Optimizing Earth Observation Satellite Schedules under Unknown Operational Constraints: An Active Constraint Acquisition Approach

WebXSkill: Skill Learning for Autonomous Web Agents