AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text

Forward LBS vs. Inverse LBS

We ablate on the design choice of forward LBS and inverse LBS. We compare the animation results of models trained with forward LBS (left) and with inverse LBS (right) strategies. When using forward LBS training, certain floating artifacts can be observed around the hip and shoulder regions during the animation process.

Forward LBS

Inverse LBS

Forward LBS

Inverse LBS

Captain America

Iron Man

Hulk, Marvel Character

Policewoman

Comparison with different condtions

We compare avatar results with different depth / normal conditions.

Depth condition

Normal condition

DensePose condition

Harry Potter

a Spanish flamenco dancer

xxx

Albert Einstein

Image quality

We update avatar results without prefixes such as "8K, HD, studio-like, blender, ultra-realistic, unreal" .

a Spanish flamenco dancer

Kim Kardashian

a woman wearing ski clothes

After fixing this bug, we reduce the "floater" artifacts, producing cleaner and artifact-free outputs. .

Chibi, single boy, cute, magician's outfit, top hat, magic wand, curly hair, shiny shoes

Chibi, 1girl, hanfu, cat ears, cat girl, silk robe, wavy hair, wearing traditional sandals

The original video results of text-driven animation. When the character moves away from the camera ("zoom-out"), the character's body only occupies a smaller pixel area, which can reduce visual clarity.

A pregnant person of color

Comparison Results

We compare AvatarStudio with current SoTA methods En3D, HumanGaussian, Rodin.

HumanGaussian

En3D

Rodin

Ours

a woman wearing ski clothes

a Texas ranger

More examples for Ablation Studies

We provide more examples of CFG-rescale and part super-resolution. The additional examples clearly illustrate how Part-aware SR contributes to detail refinement in specific body parts, enhancing overall visual fidelity. Moreover, we conduct further tests for CFG-rescale across several examples. These tests demonstrate that CFG-rescale effectively mitigates over-saturation issues, resulting in more natural and visually pleasing outputs.

without part-aware super-resolution

with part-aware super-resolution

without part-aware super-resolution

with part-aware super-resolution

Doctor Strange, Marvel Character

Hua mulan

a karate master with black belt

Terracotta

without CFG-rescale

with CFG-rescale

without CFG-rescale

with CFG-rescale

Doctor Strange, Marvel Character

Elsa in Forzen

Hua mulan

Terracotta

Citation


                    @article{anon2023avatarstudio,

                      author = {Anonymous},

                      title  = {AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text},

                      joural = {OpenReview},

                      year   = {2023},

                }