18+

Secrets AI Video Generator: How It Works, Quality, and Cost

Video generation is the feature that separates Secrets AI from most competitors in the AI companion market. While platforms like Character.AI, CrushOn AI, and Janitor AI offer no video capability at all, Secrets AI generates short motion clips from companion images via a text prompt. Whether this feature justifies the platform's pricing depends on how you plan to use it and how much Moments budget you can allocate to it.

The Competitive Context: Why This Feature Matters

In the mainstream AI companion space, video generation is rare. The majority of platforms offer static image generation at best. The ability to animate a companion — to see movement, expression, and motion rather than a still frame — represents a genuinely different product category.

The platforms in direct competition with Secrets AI on this feature:

  • Character.AI: No video generation
  • CrushOn AI: No video generation
  • Janitor AI: No video generation
  • Candy AI: Limited video available, but not as the central feature it is on Secrets AI
  • Replika: No video generation

The only platforms with comparable video capabilities are smaller or more niche: SweetDream AI and Xotic AI (which offers 4K 15-second clips). In the mainstream market, Secrets AI's video generation is a genuine differentiator — not a marginal feature advantage.

For users whose interest in AI companions is primarily visual rather than conversational, this feature changes the value calculation significantly. For users who are primarily text-chat focused, it is a secondary consideration with meaningful Moments costs.

How the Video Generator Works

The process has four steps:

Step 1 — Generate or select a source image

Video generation starts from an existing companion image. Either use one of the four auto-generated starter images from character creation or generate a new one with a specific pose or setting (25-50 Moments). Higher-quality source images produce better video outputs — this initial investment pays off in clip quality.

Step 2 — Write a motion prompt

Describe the desired movement or action in a text field. Effective prompts are specific but not overly complex: "walking slowly toward camera with a smile," "reaching out and gesturing," or "head tilt with soft laugh." Highly detailed or contradictory prompts occasionally produce unexpected results. Start simple, especially on first attempts.

Step 3 — Submit and wait

Processing takes approximately 2 minutes. The AI interprets the prompt in the context of the source image and generates a motion sequence. You cannot adjust the clip mid-generation — if the output is not what you wanted, you will need to regenerate with a revised prompt.

Step 4 — Review and save

The completed clip can be viewed directly in the platform and saved. Context-awareness is maintained — the character's appearance, clothing, and setting from the source image carry into the video output.

Quality Assessment

Independent reviewers rate Secrets AI video quality at 4.1/5 — described as "videos look good and move smoothly most of the time." In practice:

  • Character consistency: The companion's appearance, hair, and clothing from the source image transfer accurately to the video in most outputs
  • Facial expressions: Natural and responsive to the prompt context — smiling, laughing, or neutral expressions render without the uncanny stiffness common in early AI video generation
  • Motion fluidity: Movement is generally smooth; complex motions (hands gesturing, full-body movement) occasionally show minor artifacts
  • Prompt interpretation: Simple, specific prompts produce the most reliable results; abstract or highly complex prompts introduce inconsistency

Quality improves on the Premium and Advanced generation models (available on Premium and Ultimate tiers). Standard model clips are functional but noticeably less refined than Premium model outputs.

One benchmark to set expectations: these are short companion clips, not cinematic AI video. The quality is appropriate for personal companion content — natural motion, good character fidelity, realistic expressions — rather than production-level output.

What Videos Cost in Moments

The Moments cost is the primary factor governing how often most users generate video. The cost range is wide:

Clip TypeMoments CostMonthly Budget on PlusMonthly Budget on Premium
Short clip (3 sec)~50 Moments~60 clips~160 clips
Full/longer clip~600 Moments~5 clips~13 clips

Short clips at ~50 Moments are accessible even at the Plus tier (3,000 Moments/month). Full-length clips at ~600 Moments each are expensive relative to monthly allocations — Premium (8,000 Moments) provides around 13 per month if dedicated entirely to video.

Real allocation math for a video-focused user on Premium (8,000 Moments):

If you generate 10 full clips (6,000 Moments), you have 2,000 Moments remaining for images (~40-80 images) and text messaging (negligible cost per message). This is a workable budget for a user who treats video as the primary engagement mode.

If you mix 5 full clips (3,000 Moments) with 100 images (2,500-5,000 Moments), voice calls (1,000-2,000 Moments), and text messaging — you are at or near the Premium allocation ceiling.

The honest conclusion: heavy video users (10+ full clips per month) need Ultimate ($39.99) or plan to purchase Moments top-ups. Moderate video users (3-5 clips per month) can sustain that on Premium. Casual video users (1-2 clips per month) can manage on Plus.

Top-up Moments bundles start at $5.99 for 1,980 Moments — a practical way to supplement monthly allocations without upgrading to a higher tier permanently.

Video vs Images vs Voice: Cost Comparison

Understanding the relative cost of each media type clarifies how to allocate Moments efficiently:

FeatureCost (Moments)Output
Text message1-2Text response
Image (standard)25-50Single static image
Short video (3 sec)~50Brief motion clip
Full video clip~600Extended motion clip
Voice call100/minuteReal-time audio

For the cost of one full video clip (600 Moments), you could alternatively generate 12-24 images or have 6 minutes of voice conversation. This tradeoff makes video a deliberate choice rather than casual usage — you are spending 10-60x more per output compared to images.

The strategic approach most experienced users adopt: generate images first to establish the visual setup you want, then convert the best image to video. This preserves Moments while ensuring the video starts from a high-quality source.

Who Gets Value from Video Generation

Worth prioritizing if:

  • Visual companion content is as important as conversation to you
  • You want unique media output from your companion rather than just chat
  • You are on Premium or Ultimate and have Moments to allocate after conversation and image needs
  • You save and review generated content as part of your usage pattern

Lower priority if:

  • Your primary interaction is text-based conversation
  • You are on Plus or Lite and need to conserve Moments for images and messaging
  • The 2-minute generation wait interrupts your preferred interaction flow
  • Budget constraints make the 600 Moments-per-clip cost prohibitive relative to the output

Best tier for video: Ultimate ($39.99) for heavy use — 15,000 Moments provides ~25 full-length clips per month. Premium ($19.99) for moderate use — 8,000 Moments supports ~13 clips alongside other usage. Plus ($9.99) for occasional use — 3,000 Moments covers 5 full clips but leaves little room for other media.

Getting Better Results

Practical guidance that affects output quality:

  • Start with a clean, well-lit source image. The video inherits the quality of its starting frame.
  • Keep motion prompts under 15 words for the most predictable interpretation. "Smiling and reaching out, close shot" outperforms "smiling warmly and slowly extending her hand toward the camera in a welcoming gesture."
  • Test with short clips (50 Moments) before committing to full clips (600 Moments) on a new prompt style.
  • Use Premium generation models where available — the quality difference is meaningful in video output.
  • Generate during off-peak hours if you want faster processing times. The stated average is ~2 minutes, but peak hour loads can extend this.

For context on how video generation fits within the broader platform, see the full features overview. For Moments pricing and how video costs compare to the subscription tiers, the pricing guide includes the complete Moments cost table. See the free vs premium breakdown for how video access differs across subscription tiers.

Try Secrets AI Free →

FAQ

Video clip length varies by generation type and tier. Short clips are approximately 3 seconds (available from Lite tier upward at ~50 Moments each). Longer/full-length clips are available on Plus and above, costing up to 600 Moments per clip. The platform does not publish specific maximum clip lengths for full clips — reviewers describe them as "longer motion clips" with meaningful motion sequences rather than brief loops.

No. Video generation requires at least a Lite subscription ($5.99/month). The free tier does not include video access regardless of Moments balance. On the free tier, the 200 starting Moments also cannot be applied to video generation even if the Moments balance were sufficient. Upgrading to Lite unlocks 3-second short clips; Plus unlocks full video generation.

It depends on your subscription tier and whether you generate short or full-length clips. On Plus (3,000 Moments): approximately 60 short clips or 5 full clips per month. On Premium (8,000 Moments): approximately 160 short clips or 13 full clips. On Ultimate (15,000 Moments): approximately 300 short clips or 25 full clips. These figures assume Moments are dedicated entirely to video — mixed usage with images and voice reduces available video budget.

Video quality is rated 4.1/5 by independent reviewers — described as "good and smooth most of the time." Character appearance from the source image transfers accurately into motion. Facial expressions and basic movement patterns render naturally in most outputs. Complex motions (detailed hand gestures, multi-step actions) occasionally introduce minor artifacts. Quality improves on Premium and Advanced generation models available to Premium and Ultimate subscribers.

Get Started