

































































































































































































































































































































































































































































Open-source AI models ranked for adult content generation. Scores based on output quality, NSFW accuracy, speed, and creative range.
80 models ranked · We earn commissions from some links. Disclosure
80 models
| # | Model | Type | Quality | Speed | NSFW Fidelity | Versatility | Overall | |
|---|---|---|---|---|---|---|---|---|
| 1 | Black Forest Labs · Flux 2 · 12B The next generation of FLUX raises the ceiling for open-weights image generation. Prompt adherence and coherence jump from great to borderline unfair. | Image | 8.3 | Try it | ||||
| 2 | Maya Research · Maya · 3B Open-weights TTS with 20+ emotion controls. The ability to dial in breathy, whispered, or intense delivery makes it a natural fit for companion voice. | Voice | 8.3 | |||||
| 3 | Black Forest Labs · Flux · 12B The benchmark everyone chases. Prompt adherence and anatomy are absurdly good for an open model. | Image | 8.0 | Try it | ||||
| 4 | Black Forest Labs · Flux · 12B Four steps and you're done. Quality takes a hit, but nothing else touches this speed. | Image | 8.0 | Try it | ||||
| 5 | RunDiffusion · SDXL fine-tune · 6.6B Cinematic lighting out of the box. Handles full-body compositions without the usual SDXL hand disasters. | Image | 8.0 | Try it | ||||
| 6 | xAI · Grok xAI's frontier model with the loosest content policy among the majors. Will write explicit scenes without much pushing, though it still has some topic limits. | Text | 8.0 | Try it | ||||
| 7 | Sao10K · LLaMA 3 · 8B Sao10K's 8B entry built explicitly for NSFW. Runs anywhere, refuses nothing, and writes better explicit content than models three times its size. | Text | 8.0 | Try it | ||||
| 8 | Kokoro · Kokoro · 82M Tiny model that outperforms its weight class. 82M parameters running on CPU with quality that rivals cloud TTS. Open weights, no content filter, absurdly cheap. | Voice | 8.0 | |||||
| 9 | PurpleSmart · SDXL fine-tune · 6.6B The NSFW anime checkpoint that rewired Civitai. Tag-based prompting gives you surgical control over poses and anatomy. | Image | 7.8 | Try it | ||||
| 10 | SG161222 · SDXL fine-tune · 6.6B Photorealism that fools people. Skin texture and lighting are a cut above anything else on SDXL. | Image | 7.8 | Try it | ||||
| 11 | ArliAI · GLM MoE · 355B Community derestriction of GLM-4.6 that tops the UGI rankings. Massive MoE architecture delivers frontier quality with zero content filters. | Text | 7.8 | |||||
| 12 | Gryphe · LLaMA 2 · 13B The OG uncensored roleplay model that proved small models can write. Runs on a potato and still delivers surprisingly coherent NSFW scenes. | Text | 7.8 | Try it | ||||
| 13 | Stability AI · SDXL · 6.6B The workhorse that launched a thousand fine-tunes. Still the widest LoRA ecosystem by far. | Image | 7.5 | Try it | ||||
| 14 | OnomaAI · SDXL fine-tune · 6.6B Built for anime and illustration with a massive tag vocabulary. The Danbooru training data shows in the best way. | Image | 7.5 | Try it | ||||
| 15 | Lax · SDXL fine-tune · 6.6B Trained explicitly for NSFW with zero safety filters baked in. Anatomy accuracy for explicit poses is unmatched in the SDXL family. | Image | 7.5 | Try it | ||||
| 16 | Alibaba Cloud · Qwen Image Budget king from Alibaba. Quality punches near FLUX.2 at a fraction of the cost, and open weights mean you can strip the safety filters yourself. | Image | 7.5 | Try it | ||||
| 17 | Alibaba · Wan · 14B The open-source video model that actually competes with closed APIs. Motion coherence over 5+ seconds is wild. | Video | 7.5 | Try it | ||||
| 18 | Lightricks · LTX-Video 2 Major upgrade over LTX-Video with better temporal consistency. Open weights and Apache license make it the fastest path to uncensored video generation. | Video | 7.5 | Try it | ||||
| 19 | NeverSleep · LLaMA 3.1 · 8B Punches above its weight class for 8B parameters. Runs on consumer GPUs and still writes convincing scenes. | Text | 7.5 | Try it | ||||
| 20 | DeepSeek · DeepSeek MoE · 671B DeepSeek's latest MoE giant with open weights. General intelligence rivals GPT-4 class models, and the open nature means no hard content blocks. | Text | 7.5 | |||||
| 21 | Sao10K · LLaMA 3.1 · 70B The community NSFW roleplay standard. Sao10K's fine-tune delivers rich, flowing prose and handles complex multi-character scenes without losing the thread. | Text | 7.5 | Try it | ||||
| 22 | anthracite-org · Qwen 2 · 72B Built on Qwen2 with training data curated for creative fiction. Writes with a rhythm that feels authored, not generated. Handles kink negotiation in character. | Text | 7.5 | Try it | ||||
| 23 | Mistral AI · Mistral · 12B Mistral's 12B sweet spot balances quality and speed. The base model has mild guardrails that community fine-tunes strip out easily. | Text | 7.5 | Try it | ||||
| 24 | Fish Audio · Fish Speech · 500M Low-latency voice cloning that handles breathy and expressive tones well. The open-source voice model to watch. | Voice | 7.5 | Try it | ||||
| 25 | StepFun · Step Audio · 3B Emotion and style controls baked into the architecture. You can shape tone without prompt hacking, and open weights mean no guardrails on what you generate. | Voice | 7.5 | |||||
| 26 | Nari Labs · Dia · 1.6B Generates dialogue between multiple speakers from a script. Natural turn-taking and emotion tags make it interesting for AI companion voice scenarios. | Voice | 7.5 | |||||
| 27 | Community Merge · LLaMA · 70B The 70B merge that keeps the uncensored roleplay community fed. Prose quality and character consistency are hard to beat at this size. | Text | 7.3 | Try it | ||||
| 28 | NeverSleep · LLaMA 2 · 20B Built for NSFW chat with zero guardrails. Keeps character voice consistent across long sessions without drifting. | Text | 7.3 | Try it | ||||
| 29 | TheDrummer · Mistral · 123B Purpose-built Mistral merge for unrestricted creative writing. Prose quality at 123B is a real step up from the 70B class, and it refuses nothing. | Text | 7.3 | |||||
| 30 | darkc0de · Mistral · 123B The UGI #4 open-weights model. Named like malware, writes like a bestselling author. Zero refusals and strong long-context performance. | Text | 7.3 | |||||
| 31 | DeepSeek · DeepSeek MoE · 671B The 671B MoE model that shocked the industry with GPT-4 class performance at open-weights pricing. Mild content filters in the official API, but self-hosted removes them. | Text | 7.3 | |||||
| 32 | KaraKaraWitch · LLaMA 3.3 · 70B Community merge on Llama 3.3 that scores high on both general intelligence and willingness. Prose quality is polished for a merge. | Text | 7.3 | Try it | ||||
| 33 | ElevenLabs · ElevenLabs The voice quality benchmark everything else gets measured against. 70+ languages, incredible expressiveness, but content policy blocks explicit audio. | Voice | 7.3 | Try it | ||||
| 34 | ElevenLabs · ElevenLabs ~75ms latency makes this the speed king. Quality trades off slightly for real-time responsiveness, perfect for live AI companion voice. | Voice | 7.3 | Try it | ||||
| 35 | Columbia University · StyleTTS Diffusion-based TTS that produces remarkably natural prosody. Academic origin means no commercial agenda and no content filtering. | Voice | 7.3 | |||||
| 36 | Sesame · CSM · 1B Context-sensitive speech model that maintains voice character across long conversations. Designed for sustained dialogue, not just isolated utterances. | Voice | 7.3 | |||||
| 37 | Cagliostro Research · SDXL fine-tune · 6.6B Clean linework and consistent character faces. Better at SFW anime than explicit content, but it handles both. | Image | 7.0 | Try it | ||||
| 38 | TareksGraveyard · LLaMA · 70B Merge focused on stylistic prose with a slant toward dramatic, literary NSFW writing. Less casual than MythoMax, more novel than chatbot. | Text | 7.0 | Try it | ||||
| 39 | Coqui · XTTS Voice cloning TTS that supports 17 languages. Coqui shut down, but the model lives on. Still a solid choice for self-hosted voice cloning with no restrictions. | Voice | 7.0 | |||||
| 40 | Stability AI · SD3.5 · 8B Strong prompt following and text rendering, but the NSFW fine-tune scene hasn't caught up yet. | Image | 6.8 | Try it | ||||
| 41 | Black Forest Labs · Flux 2 · 12B Commercial FLUX.2 variant with safety filters baked in. Extraordinary quality but locked down for adult content. | Image | 6.8 | Try it | ||||
| 42 | Tencent · HunyuanVideo · 13B Cinematic motion quality with good temporal consistency. NSFW fine-tunes are still early but promising. | Video | 6.8 | Try it | ||||
| 43 | Lightricks · LTX-Video · 2B Lightweight and fast for a video model. Quality trails the heavyweights, but iteration speed makes up for it. | Video | 6.8 | Try it | ||||
| 44 | Tencent · HunyuanVideo · 13B Incremental upgrade that smooths out motion artifacts from the original. NSFW fine-tunes are gaining traction in the community. | Video | 6.8 | Try it | ||||
| 45 | Alibaba · Wan · 14B Incremental open-weights update to Wan 2.1. Community NSFW fine-tunes carry over from the original, and quality got a small bump. | Video | 6.8 | |||||
| 46 | Meta · LLaMA 3.1 · 70B Meta's workhorse that spawns most of the best NSFW fine-tunes. The base model resists explicit content, but the architecture is the foundation of the uncensored LLM ecosystem. | Text | 6.8 | Try it | ||||
| 47 | Alibaba · Qwen 2.5 · 72B Alibaba's challenger that matches Llama 3.1 70B on benchmarks while being more permissive out of the box. Increasingly popular as a base for uncensored merges. | Text | 6.8 | Try it | ||||
| 48 | ElevenLabs · ElevenLabs 29-language TTS with natural prosody. The previous gen workhorse before v3 dropped, still widely used for its stability. | Voice | 6.8 | Try it | ||||
| 49 | MiniMax · MiniMax MiniMax's HD tier rivals ElevenLabs on raw quality. The latency is higher, but the output sounds startlingly human. | Voice | 6.8 | Try it | ||||
| 50 | MiniMax · MiniMax Speed-optimized MiniMax variant. Quality dips slightly from HD but inference speed makes it viable for real-time applications. | Voice | 6.8 | Try it | ||||
| 51 | Hugging Face · Parler · 2.3B Describe the voice you want in natural language and it generates matching speech. Novel approach that opens creative possibilities for character voices. | Voice | 6.8 | |||||
| 52 | MetaVoice · MetaVoice · 1.2B Zero-shot voice cloning from short audio samples. English-focused but handles emotional range reasonably well for its size. | Voice | 6.8 | |||||
| 53 | OpenAI · GPT-4o multimodal OpenAI's native image generation through GPT-4o. Text rendering and compositional understanding are a tier above everything else, but strict content filters kill the NSFW use case. | Image | 6.5 | Try it | ||||
| 54 | Midjourney · Midjourney Still the aesthetic benchmark that everything gets compared to. Art direction and composition are second to none, but Discord-only access and zero NSFW tolerance limit its reach. | Image | 6.5 | |||||
| 55 | Mistral AI · Mistral · 123B Mistral's flagship API model with strong multilingual performance. Content filtering is present but less aggressive than OpenAI or Anthropic. | Text | 6.5 | |||||
| 56 | OpenAI · OpenAI TTS Six built-in voices, low latency, dead simple API. Not the most expressive option but reliable and fast, with strict content filtering. | Voice | 6.5 | |||||
| 57 | Google · Imagen Google's latest image model with photorealistic output that rivals GPT Image. Completely walled off from adult content through Vertex AI safety layers. | Image | 6.3 | Try it | ||||
| 58 | THUDM · CogVideoX · 5B Solid text-to-video with decent motion, though it struggles with complex multi-subject scenes. | Video | 6.3 | Try it | ||||
| 59 | xAI · Grok xAI's video generation leads on image-to-video benchmarks. Slightly less filtered than competitors but still blocks explicit content. | Video | 6.3 | Try it | ||||
| 60 | Google · WaveNet / Studio Google's Studio voices sound polished and professional. Large language support, but the content policy is as strict as you'd expect from Google. | Voice | 6.3 | |||||
| 61 | Google · Gemini Gemini's multimodal architecture applied to speech. Fast and contextually aware, but filtered through Google's safety layers. | Voice | 6.3 | |||||
| 62 | Suno · Bark Generates speech, music, and sound effects from text prompts. Quality trails purpose-built TTS models, but the creative flexibility is unmatched and it's completely open. | Voice | 6.3 | |||||
| 63 | Recraft · Recraft · 20B Design-focused model that nails text rendering inside images. Built for branding and illustration, not your goon cave. | Image | 6.0 | Try it | ||||
| 64 | Ideogram · Ideogram Strongest text-in-image generation on the market. Logos, posters, and typography render clean, but the safety rails block adult prompts hard. | Image | 6.0 | |||||
| 65 | Playground AI · Playground Consumer-friendly image generation with a polished web interface. Quality is competitive but the walled-garden model means no path to NSFW. | Image | 6.0 | |||||
| 66 | KlingAI · Kling The current king of AI video. Motion quality and temporal consistency at 1080p are absurd, but zero chance of NSFW output through their API. | Video | 6.0 | Try it | ||||
| 67 | Skywork AI · SkyReels Newcomer that's eating Kling's lunch on quality benchmarks. Priced lower too, but still fully filtered for safety. | Video | 6.0 | |||||
| 68 | KlingAI · Kling Previous gen Kling that's still extremely competitive. Turbo mode runs faster than most alternatives at similar quality. | Video | 6.0 | Try it | ||||
| 69 | Cohere · Command · 104B Cohere's RAG-optimized model with open weights. Strong for factual writing and search grounding, but not built for creative NSFW and it shows. | Text | 6.0 | |||||
| 70 | OpenAI · OpenAI TTS Higher fidelity version of TTS-1. Slower but noticeably cleaner audio, especially on sibilants and breathy passages. | Voice | 6.0 | |||||
| 71 | OpenAI · DALL·E The previous gen OpenAI image model. Prompt following is strong for its era, but it's been lapped by GPT Image and FLUX.2. | Image | 5.8 | Try it | ||||
| 72 | Runway · Gen-4 Runway's latest pushes cinematic quality and camera control. Hollywood uses it; your goon cave cannot. | Video | 5.8 | Try it | ||||
| 73 | Google · Veo Google's video model with native audio generation. Cinematic motion and scene transitions are top tier, locked behind Vertex AI safety. | Video | 5.8 | Try it | ||||
| 74 | Alibaba · Wan · 14B The proprietary API version of Wan pushes quality further than the open release. Filtered through Alibaba's cloud safety, so NSFW is blocked. | Video | 5.8 | Try it | ||||
| 75 | OpenAI · Sora OpenAI's video flagship. Impressive scene understanding and physics simulation, but the most expensive option by far and aggressively filtered. | Video | 5.5 | Try it | ||||
| 76 | PixVerse · PixVerse Consistently strong across both text-to-video and image-to-video. Good value for the quality, but the content policy is strict. | Video | 5.5 | Try it | ||||
| 77 | Vidu · Vidu Chinese-made video model that ranks surprisingly high on global benchmarks. Image-to-video quality is a standout. | Video | 5.5 | Try it | ||||
| 78 | Luma Labs · Ray Luma's latest handles 3D-consistent motion well. Camera movements feel natural, but the content filter catches anything remotely suggestive. | Video | 5.5 | Try it | ||||
| 79 | Pika · Pika Consumer-friendly video generation with fun effects and lip-sync. More playful than cinematic, and locked down tight on content. | Video | 5.5 | |||||
| 80 | MiniMax · MiniMax Video MiniMax's video model punches above its weight on motion quality. Affordable API pricing for a filtered service. | Video | 5.5 | Try it |
The next generation of FLUX raises the ceiling for open-weights image generation. Prompt adherence and coherence jump from great to borderline unfair.
Open-weights TTS with 20+ emotion controls. The ability to dial in breathy, whispered, or intense delivery makes it a natural fit for companion voice.
The benchmark everyone chases. Prompt adherence and anatomy are absurdly good for an open model.
Four steps and you're done. Quality takes a hit, but nothing else touches this speed.
Cinematic lighting out of the box. Handles full-body compositions without the usual SDXL hand disasters.
xAI's frontier model with the loosest content policy among the majors. Will write explicit scenes without much pushing, though it still has some topic limits.
Sao10K's 8B entry built explicitly for NSFW. Runs anywhere, refuses nothing, and writes better explicit content than models three times its size.
Tiny model that outperforms its weight class. 82M parameters running on CPU with quality that rivals cloud TTS. Open weights, no content filter, absurdly cheap.
The NSFW anime checkpoint that rewired Civitai. Tag-based prompting gives you surgical control over poses and anatomy.
Photorealism that fools people. Skin texture and lighting are a cut above anything else on SDXL.
Community derestriction of GLM-4.6 that tops the UGI rankings. Massive MoE architecture delivers frontier quality with zero content filters.
The OG uncensored roleplay model that proved small models can write. Runs on a potato and still delivers surprisingly coherent NSFW scenes.
The workhorse that launched a thousand fine-tunes. Still the widest LoRA ecosystem by far.
Built for anime and illustration with a massive tag vocabulary. The Danbooru training data shows in the best way.
Trained explicitly for NSFW with zero safety filters baked in. Anatomy accuracy for explicit poses is unmatched in the SDXL family.
Budget king from Alibaba. Quality punches near FLUX.2 at a fraction of the cost, and open weights mean you can strip the safety filters yourself.
The open-source video model that actually competes with closed APIs. Motion coherence over 5+ seconds is wild.
Major upgrade over LTX-Video with better temporal consistency. Open weights and Apache license make it the fastest path to uncensored video generation.
Punches above its weight class for 8B parameters. Runs on consumer GPUs and still writes convincing scenes.
DeepSeek's latest MoE giant with open weights. General intelligence rivals GPT-4 class models, and the open nature means no hard content blocks.
The community NSFW roleplay standard. Sao10K's fine-tune delivers rich, flowing prose and handles complex multi-character scenes without losing the thread.
Built on Qwen2 with training data curated for creative fiction. Writes with a rhythm that feels authored, not generated. Handles kink negotiation in character.
Mistral's 12B sweet spot balances quality and speed. The base model has mild guardrails that community fine-tunes strip out easily.
Low-latency voice cloning that handles breathy and expressive tones well. The open-source voice model to watch.
Emotion and style controls baked into the architecture. You can shape tone without prompt hacking, and open weights mean no guardrails on what you generate.
Generates dialogue between multiple speakers from a script. Natural turn-taking and emotion tags make it interesting for AI companion voice scenarios.
The 70B merge that keeps the uncensored roleplay community fed. Prose quality and character consistency are hard to beat at this size.
Built for NSFW chat with zero guardrails. Keeps character voice consistent across long sessions without drifting.
Purpose-built Mistral merge for unrestricted creative writing. Prose quality at 123B is a real step up from the 70B class, and it refuses nothing.
The UGI #4 open-weights model. Named like malware, writes like a bestselling author. Zero refusals and strong long-context performance.
The 671B MoE model that shocked the industry with GPT-4 class performance at open-weights pricing. Mild content filters in the official API, but self-hosted removes them.
Community merge on Llama 3.3 that scores high on both general intelligence and willingness. Prose quality is polished for a merge.
The voice quality benchmark everything else gets measured against. 70+ languages, incredible expressiveness, but content policy blocks explicit audio.
~75ms latency makes this the speed king. Quality trades off slightly for real-time responsiveness, perfect for live AI companion voice.
Diffusion-based TTS that produces remarkably natural prosody. Academic origin means no commercial agenda and no content filtering.
Context-sensitive speech model that maintains voice character across long conversations. Designed for sustained dialogue, not just isolated utterances.
Clean linework and consistent character faces. Better at SFW anime than explicit content, but it handles both.
Merge focused on stylistic prose with a slant toward dramatic, literary NSFW writing. Less casual than MythoMax, more novel than chatbot.
Voice cloning TTS that supports 17 languages. Coqui shut down, but the model lives on. Still a solid choice for self-hosted voice cloning with no restrictions.
Strong prompt following and text rendering, but the NSFW fine-tune scene hasn't caught up yet.
Commercial FLUX.2 variant with safety filters baked in. Extraordinary quality but locked down for adult content.
Cinematic motion quality with good temporal consistency. NSFW fine-tunes are still early but promising.
Lightweight and fast for a video model. Quality trails the heavyweights, but iteration speed makes up for it.
Incremental upgrade that smooths out motion artifacts from the original. NSFW fine-tunes are gaining traction in the community.
Incremental open-weights update to Wan 2.1. Community NSFW fine-tunes carry over from the original, and quality got a small bump.
Meta's workhorse that spawns most of the best NSFW fine-tunes. The base model resists explicit content, but the architecture is the foundation of the uncensored LLM ecosystem.
Alibaba's challenger that matches Llama 3.1 70B on benchmarks while being more permissive out of the box. Increasingly popular as a base for uncensored merges.
29-language TTS with natural prosody. The previous gen workhorse before v3 dropped, still widely used for its stability.
MiniMax's HD tier rivals ElevenLabs on raw quality. The latency is higher, but the output sounds startlingly human.
Speed-optimized MiniMax variant. Quality dips slightly from HD but inference speed makes it viable for real-time applications.
Describe the voice you want in natural language and it generates matching speech. Novel approach that opens creative possibilities for character voices.
Zero-shot voice cloning from short audio samples. English-focused but handles emotional range reasonably well for its size.
OpenAI's native image generation through GPT-4o. Text rendering and compositional understanding are a tier above everything else, but strict content filters kill the NSFW use case.
Still the aesthetic benchmark that everything gets compared to. Art direction and composition are second to none, but Discord-only access and zero NSFW tolerance limit its reach.
Mistral's flagship API model with strong multilingual performance. Content filtering is present but less aggressive than OpenAI or Anthropic.
Six built-in voices, low latency, dead simple API. Not the most expressive option but reliable and fast, with strict content filtering.
Google's latest image model with photorealistic output that rivals GPT Image. Completely walled off from adult content through Vertex AI safety layers.
Solid text-to-video with decent motion, though it struggles with complex multi-subject scenes.
xAI's video generation leads on image-to-video benchmarks. Slightly less filtered than competitors but still blocks explicit content.
Google's Studio voices sound polished and professional. Large language support, but the content policy is as strict as you'd expect from Google.
Gemini's multimodal architecture applied to speech. Fast and contextually aware, but filtered through Google's safety layers.
Generates speech, music, and sound effects from text prompts. Quality trails purpose-built TTS models, but the creative flexibility is unmatched and it's completely open.
Design-focused model that nails text rendering inside images. Built for branding and illustration, not your goon cave.
Strongest text-in-image generation on the market. Logos, posters, and typography render clean, but the safety rails block adult prompts hard.
Consumer-friendly image generation with a polished web interface. Quality is competitive but the walled-garden model means no path to NSFW.
The current king of AI video. Motion quality and temporal consistency at 1080p are absurd, but zero chance of NSFW output through their API.
Newcomer that's eating Kling's lunch on quality benchmarks. Priced lower too, but still fully filtered for safety.
Previous gen Kling that's still extremely competitive. Turbo mode runs faster than most alternatives at similar quality.
Cohere's RAG-optimized model with open weights. Strong for factual writing and search grounding, but not built for creative NSFW and it shows.
Higher fidelity version of TTS-1. Slower but noticeably cleaner audio, especially on sibilants and breathy passages.
The previous gen OpenAI image model. Prompt following is strong for its era, but it's been lapped by GPT Image and FLUX.2.
Runway's latest pushes cinematic quality and camera control. Hollywood uses it; your goon cave cannot.
Google's video model with native audio generation. Cinematic motion and scene transitions are top tier, locked behind Vertex AI safety.
The proprietary API version of Wan pushes quality further than the open release. Filtered through Alibaba's cloud safety, so NSFW is blocked.
OpenAI's video flagship. Impressive scene understanding and physics simulation, but the most expensive option by far and aggressively filtered.
Consistently strong across both text-to-video and image-to-video. Good value for the quality, but the content policy is strict.
Chinese-made video model that ranks surprisingly high on global benchmarks. Image-to-video quality is a standout.
Luma's latest handles 3D-consistent motion well. Camera movements feel natural, but the content filter catches anything remotely suggestive.
Consumer-friendly video generation with fun effects and lip-sync. More playful than cinematic, and locked down tight on content.
MiniMax's video model punches above its weight on motion quality. Affordable API pricing for a filtered service.
Every model on this leaderboard is open-weight and publicly available. We test each one with a standardized prompt set that covers anatomy, poses, lighting, and stylistic range. Quality measures raw output fidelity. Speed reflects generation time on comparable hardware. NSFW Fidelity tracks how accurately the model renders explicit content without anatomical errors or content refusals. Versatility covers the range of styles, body types, and scenarios the model handles well.
Scores are editorial, based on our testing across multiple configurations. Your results will vary depending on prompts, samplers, and hardware. Models get re-tested when major updates ship.