From Logs to Agents: Reconstructing High-Level Creative Workflows from Low-Level Raw System Traces

Imagine you are watching a master chef cook a complex dish. You want to understand their creative process: Why did they add salt? Why did they switch from a knife to a whisk? How did they decide to change the recipe halfway through?

Now, imagine the only way you can see what happened is by looking at a massive, chaotic spreadsheet of the chef's hand movements. It lists every single muscle twitch, every time they blinked, every time they adjusted their grip, and every time the kitchen timer beeped.

This is the problem with current AI design tools.

The paper "From Logs to Agents" by Tae Hee Jo and Kyung Hoon Hyun argues that while we have tons of data about how people use creative AI tools (like node-based generators), it's all "noise." It's like that messy spreadsheet. It tells us that a button was clicked, but not why or what the creative idea was.

Here is the paper's solution, broken down into simple concepts:

1. The Problem: The "Black Box" of Creativity

Current AI tools are like magic boxes. You put a prompt in, and an image comes out. But inside, the computer is doing thousands of tiny, invisible things (connecting wires, cleaning up memory, updating settings).

The Analogy: If you try to understand a movie by looking at the raw electrical signals flickering through the projector bulb, you'll miss the story. You need the movie, not the electricity.
The Issue: Researchers and AI agents (smart assistants) can't understand the user's "creative intent" because the data is too messy and low-level.

2. The Solution: Turning Noise into a Story

The authors built a "translator" pipeline that turns those messy raw logs into a clear, high-level story of the design process. They do this in three steps:

Step A: The Noise Filter (Cleaning the Kitchen)

First, they ignore the boring stuff. They filter out the "system noise"—like the computer cleaning up temporary files or syncing data.

The Analogy: Imagine a detective ignoring the sound of the wind and the ticking of a clock to focus only on the footsteps and the voice of the suspect.
The Result: They strip away 40% of the data, leaving only the "real" creative moves: Inserting a new idea, Modifying an existing one, Generating a new result, or Removing something that didn't work.

Step B: The Family Tree (Visualizing the Journey)

Creativity isn't a straight line; it's a tree. You might try a blue sky, then branch off to try a purple sky, decide you hate it, and go back to the blue sky to make it brighter.

The Analogy: Think of a "Choose Your Own Adventure" book. The authors build a map (a graph) that shows every branch the user took.
The Result: Instead of a long list of timestamps, they create a visual map showing how one idea led to another, revealing where the user was "exploring" (trying many branches) and where they were "refining" (deepening one branch).

Step C: The Secret Code (Tokenization)

Finally, they turn these visual steps into a simple language (tokens) that computers can easily read.

The Analogy: Instead of saying "The user clicked the red button on the left panel to change the brightness of the image," they just say: MODIFY_IMAGE.
The Result: This creates a universal language. It doesn't matter what specific software the user used; the AI can now understand the action.

3. Why This Matters: From "Dumb" Tools to "Smart" Partners

The ultimate goal of this research is to build Process-Aware Agents.

The Old Way (Passive Tool): You ask the AI, "Make this image brighter." The AI does it. It has no memory of what you did five minutes ago. It's like a waiter who only hears your current order and forgets everything else.
The New Way (Smart Partner): Because the AI now has the "Family Tree" of your workflow, it can predict what you need next.
- The Analogy: Imagine a sous-chef who watches you chop onions, then realize you've done it five times in a row. Instead of waiting for you to ask, the sous-chef says, "I see you're prepping a lot of onions. Should I start chopping the next batch for you?"
- Real Example: If the AI sees you are constantly resizing images after importing them, it might say, "I notice you resize every image you import. Would you like me to do that automatically next time?"

The Big Takeaway

This paper provides the missing link between raw data and human understanding. By cleaning up the messy logs and turning them into a clear story of how a human thinks and creates, we can teach AI agents to stop just "executing commands" and start "collaborating" with us.

It's the difference between a robot that just follows orders and a creative partner who understands your journey.

From Logs to Agents: Reconstructing High-Level Creative Workflows from Low-Level Raw System Traces

1. The Problem: The "Black Box" of Creativity

2. The Solution: Turning Noise into a Story

Step A: The Noise Filter (Cleaning the Kitchen)

Step B: The Family Tree (Visualizing the Journey)

Step C: The Secret Code (Tokenization)

3. Why This Matters: From "Dumb" Tools to "Smart" Partners

The Big Takeaway

1. Problem Statement

2. Methodology

Stage 1: Semantic Filtering and Classification (De-noising)

Stage 2: Design Sequence Reconstruction (Workflow Visualization)

Stage 3: Tokenization

3. Key Results and Analysis

4. Key Contributions

5. Significance and Future Implications

From Logs to Agents: Reconstructing High-Level Creative Workflows from Low-Level Raw System Traces

1. The Problem: The "Black Box" of Creativity

2. The Solution: Turning Noise into a Story

Step A: The Noise Filter (Cleaning the Kitchen)

Step B: The Family Tree (Visualizing the Journey)

Step C: The Secret Code (Tokenization)

3. Why This Matters: From "Dumb" Tools to "Smart" Partners

The Big Takeaway

1. Problem Statement

2. Methodology

Stage 1: Semantic Filtering and Classification (De-noising)

Stage 2: Design Sequence Reconstruction (Workflow Visualization)

Stage 3: Tokenization

3. Key Results and Analysis

4. Key Contributions

5. Significance and Future Implications

More like this

Monotone Comparative Statics without Lattices

Motion Illusions Generated Using Predictive Neural Networks Also Fool Humans

Performance Analysis of IEEE 802.11p Preamble Insertion in C-V2X Sidelink Signals for Co-Channel Coexistence

Construction of time-varying ISS-Lyapunov Functions for Impulsive Systems

Real-Time BDI Agents: a model and its implementation