Snapmoji: Instant Generation of Animatable Dual-Stylized Avatars

Snapmoji is an innovative avatar generation system that instantly creates animatable 3D avatars from user selfies by employing Gaussian Domain Adaptation for identity preservation and a novel dual-stylization process to apply secondary styles, all while overcoming the limitations of existing platforms through efficient mobile rendering.

Eric M. Chen, Di Liu, Sizhuo Ma, Michael Vasilkovsky, Bing Zhou, Qiang Gao, Wenzhou Wang, Jiahao Luo, Dimitris N. Metaxas, Vincent Sitzmann, Jian Wang

Published 2026-03-10
📖 4 min read☕ Coffee break read

Imagine you have a favorite cartoon character that looks exactly like you (like a Bitmoji). Now, imagine you could instantly turn that character into a LEGO figure, a skeleton, a yarn doll, or a toy, all while keeping your face recognizable. And the best part? You can make that toy skeleton blink, smile, and talk in real-time on your phone, all in less than a second.

That is Snapmoji.

Here is a simple breakdown of how this magic trick works, using everyday analogies:

1. The Problem: The "Toy Store" Limitation

Right now, apps like Snapchat or Apple let you build an avatar by picking from a pre-made menu: "Do you want blue eyes? A red hat? A beard?"

  • The Analogy: It's like going to a toy store where you can only buy the toys that are already on the shelf. If you want a "Spiderman" version of your avatar, the store doesn't have it. The artists have to hand-sculpt a new Spiderman toy from scratch, which takes weeks.
  • The Goal: Snapmoji wants to be a 3D printer that can instantly print any style you can imagine, without needing a factory full of artists.

2. The Solution: The Two-Step Magic Trick

Snapmoji does this in two lightning-fast steps:

Step 1: The "Identity Translator" (Gaussian Domain Adaptation)

First, the system takes your selfie and turns it into a standard cartoon avatar (the "Base Avatar").

  • The Analogy: Think of this as a universal translator. You speak "Real Human," and it instantly translates you into "Cartoon Language."
  • The Secret Sauce: They use a technique called Gaussian Domain Adaptation (GDA). Imagine a master artist who has studied millions of 3D objects (like a library of every toy ever made). Instead of just copying your face, this artist understands the structure of 3D space. This allows them to turn your photo into a cartoon in 0.08 seconds (faster than a blink) while keeping your unique features (like your specific eye color or glasses) perfectly intact.

Step 2: The "Style Chameleon" (Dual-Stylization)

Once you have your base cartoon, you type in a prompt like "Make me a Yoda" or "Make me a LEGO minifigure."

  • The Analogy: This is like putting a magic filter on your photo, but instead of just changing colors, it completely rebuilds the texture and shape.
  • How it works: It uses a powerful AI (a diffusion model) that acts like a creative director. It looks at your base cartoon and says, "Okay, let's turn the skin into plastic bricks for LEGO," or "Let's turn the hair into yarn." It does this in less than a second, ensuring you still look like you, just in a different costume.

3. Bringing It to Life: The "Puppet Master"

So far, we have a static picture. How do we make it move?

  • The Old Way: Most 3D avatars rely on complex math models designed for real human faces. If you try to put a cartoon head with giant eyes on that model, it breaks or looks weird.
  • The Snapmoji Way: They use a hybrid approach. They combine 3DMM (for the realistic parts of the face) with FACS (a system used by animators to control cartoon features like "squint eyes" or "open mouth").
  • The Result: It's like having a puppet master. When you smile on your phone camera, the system instantly maps your smile to the cartoon avatar. Because the math is so efficient, it runs smoothly on your phone at 30–40 frames per second (like a smooth video game), even in Augmented Reality (AR).

Why is this a big deal?

  1. Speed: It used to take artists days to make a new avatar style. Snapmoji does it in 0.9 seconds.
  2. Creativity: You aren't limited to a menu. You can be a "Cyberpunk Cat" or a "Medieval Knight" instantly.
  3. Real-time: You can use it in video calls or AR filters right now, without lagging your phone.

In a Nutshell

Snapmoji is like a magic mirror. You look into it, and it doesn't just show you a reflection; it shows you a LEGO version of yourself that can dance, talk, and react to your real-life expressions, all generated instantly by AI. It bridges the gap between the boring "menu of options" we have today and a world where your digital self can be anything you can imagine.