When did virtual try-on first work well enough to deploy commercially?

For accessories (glasses, jewelry), around 2018–2019 using AR. For apparel at photorealistic quality, 2022–2023 with generative AI diffusion models.

What made diffusion models better than earlier texture-mapping approaches?

Diffusion models generate the full image — fabric drape, lighting, and occlusion all emerge from the generation process. Texture mapping manually places 2D fabric over a body outline, which produces unrealistic edges and incorrect lighting.

Is footwear virtual try-on commercially viable in 2026?

Early commercial implementations exist, but render quality for footwear — particularly sole geometry and material reflection — is not yet at the threshold that apparel try-on has reached. Footwear try-on is likely 12–24 months behind apparel maturity.

What is Nano Banana 2?

Nano Banana 2 is the generative AI model Photta uses for virtual try-on. It is fine-tuned specifically on fashion and jewelry to handle apparel-specific rendering challenges like fabric drape, transparent materials, and metallic jewelry surfaces.

Will video try-on replace photo try-on?

Video try-on is in active development but not yet at commercial quality for apparel. Photo-based try-on will remain the dominant commercial format for at least the next 2–3 years while video models mature.

Guide · History

History of Virtual Try-On

Virtual try-on has a longer history than most people realize — the concept predates the web — but meaningful commercial viability for online fashion retail only arrived with the generative AI inflection of 2022.

Try Photta free See pricing

The quick read

Virtual try-on experiments began in the 1990s as mall kiosk installations — interesting proofs of concept, never commercially viable at scale.
The 2010s AR wave (driven by Snapchat's lens platform and mobile camera ubiquity) made accessories try-on viable but struggled with apparel drape.
The 2022 generative AI inflection — when diffusion models reached photorealistic quality for clothing — is when photo-based apparel try-on became a genuine commercial product.

1990s: mall kiosk experiments

The earliest documented virtual dressing room concepts appeared in retail research labs and academic papers in the early 1990s. The implementation typically involved a camera, a mirror-shaped display, and rule-based graphics overlays that could superimpose a simplified clothing silhouette on a video feed. IBM and several European retail groups ran limited pilot installations in department stores and mall kiosks between 1994 and 1999.

These systems were impractical for two reasons: the compute required for real-time video processing was expensive and physically large, and the graphics quality was far below the threshold where shoppers found the result believable. Adoption was uniformly low. The projects were useful as demonstrations that the concept could work in principle, but the technology was decades away from commercial viability.

2010s: the AR and mobile camera moment

The proliferation of smartphones with front-facing cameras and dedicated image signal processors created the first viable mass-market AR try-on platform. Snapchat's Lens Studio, launched in 2017, democratized face-AR creation and demonstrated that tens of millions of users would engage with real-time augmented reality experiences when the latency was low enough and the render quality was high enough.

Fashion and beauty brands moved quickly into this space. Sunglasses brands built lens-style try-ons for glasses. Cosmetics brands offered real-time lip color and foundation shade previews via AR. These applications worked well because they required tracking a relatively rigid surface — the face — which is a more tractable problem than tracking draped fabric on a moving body. By 2019, AR try-on was a proven commercial tool for accessories and beauty, but still largely unproven for apparel.

2018–2020: first-generation e-commerce try-on

The first wave of e-commerce virtual try-on products — targeting apparel specifically — launched between 2018 and 2020. These products typically used a combination of body pose estimation (estimating the 3D position of body joints from a 2D image) and texture mapping to drape a 2D garment texture onto a detected body silhouette. The results were technically impressive but visually unconvincing: fabric edges were poorly defined, lighting was inconsistent, and complex garments like layered outerwear or flowing dresses produced artifacts.

Commercial adoption was limited. Several well-funded startups in this space either pivoted to B2B catalog photography or closed between 2020 and 2022. The fundamental problem was not compute or engineering effort — substantial capital was deployed on both — but model architecture: texture-mapping approaches could not realistically simulate how fabric drapes, folds, and interacts with body geometry.

2022: the generative AI inflection

The release of latent diffusion models with sufficient resolution and control mechanisms — the technical foundation of image generation systems that emerged prominently in 2022 — changed what was possible for virtual try-on in a fundamental way. Instead of texture-mapping a garment onto a body, diffusion-based models could generate a photorealistic image of a person wearing a garment, conditioned on both the person's photo and the garment's appearance. The fabric drape, lighting interaction, and body occlusion all emerged from the generation process rather than from explicit simulation.

This architectural shift is what made photo-based apparel try-on a commercial product. Photta launched its B2B widget powered by Nano Banana 2, a fine-tuned diffusion model optimized for fashion and jewelry applications, as part of this generative AI era. The render quality crossed the threshold that drives actual commercial outcomes: shoppers found the results believable enough to make purchase decisions based on them, as evidenced by the conversion and return-rate data from Photta's merchant cohort.

2026: where the technology stands

As of 2026, generative AI-based virtual try-on is a mature commercial product for apparel and jewelry. The technology delivers consistent photorealistic results at acceptable latency (8–15 seconds), scales to catalogs of any size without per-SKU production overhead, and has accumulated enough merchant deployment data to support reliable ROI benchmarks. The question for a fashion merchant in 2026 is not 'does this technology work?' but 'which implementation fits my catalog and traffic level?'

Adjacent applications remain earlier in their development cycle. Footwear try-on presents specific challenges around foot geometry and sole rendering that apparel diffusion models do not address well. Video-format try-on — generating a short clip rather than a static image — is in active development but not yet at the render quality threshold for commercial deployment at scale. Multi-garment outfit composition (try on a top, bottom, and accessory simultaneously) is an active research area with early commercial implementations beginning to appear in 2025–2026.

Built on the 2022 generative AI breakthrough

🤖

Nano Banana 2 model

Fine-tuned diffusion model for fashion and jewelry. Photorealistic drape, lighting, and silhouette — not texture mapping.

⚡

8–15 second renders

Latency that shoppers accept. Fast enough to use in a real purchase session without abandonment.

📸

Any catalog, any size

No per-SKU 3D production. The AI reads your existing 2D product images at try-on time.

📈

Proven merchant outcomes

Sufficient cohort data to support reliable benchmarks: 18–28% conversion lift, 25–30% return reduction.

FAQ

: For accessories (glasses, jewelry), around 2018–2019 using AR. For apparel at photorealistic quality, 2022–2023 with generative AI diffusion models.

Try Photta free for 14 days

Three pricing tiers from $49/mo. No credit card required to start.

View plans

Deploy 2026-era try-on technology today

Generative AI. Photorealistic results. 14 days free.

Start free trial