Create a square 1:1 ultra-realistic cinematic comedy video, 10 seconds, set at a sunny upscale sidew
Create a square 1:1 ultra-realistic cinematic comedy video, 10 seconds, set at a sunny upscale sidewalk cafe in front of a reflective glass storefront. STYLE: Ultra-realistic live-action cinematic look, fashion-commercial quality, crisp daylight, shallow depth of field, polished camera movement, realistic reflections, realistic glass physics, realistic espresso liquid motion, comedic timing, no horror or danger tone. MAIN CHARACTER: A fashionable woman in ornate Western-inspired clothing, consistent identity throughout the entire video. She has styled brown hair, elegant makeup, expressive eyes, large decorative earrings, a red neck scarf, a dark blue embroidered Western-style jacket and matching skirt with fringe, and black thigh-high boots. She is glamorous, confident, amused, and composed. SECOND CHARACTER: A white humanoid robot waiter behind the glass storefront, consistent design throughout. Smooth white body, black visor-like eyes, black bow tie, white waiter uniform/apron, holding a small tray with an espresso cup and saucer. The robot should feel awkward and polite, not threatening. SETTING: Sunny upscale European-style sidewalk cafe. Round brass cafe table in the foreground, wicker cafe chair, reflective glass storefront behind the woman, elegant interior lights visible through the glass, city street reflections on the window, warm daylight, crisp shadows, stylish luxury atmosphere. VIDEO SEQUENCE: SHOT 1 — SETUP, 0.00–1.80s Medium-wide profile shot. The woman sits side-on beside the round brass cafe table, legs crossed, calm and glamorous. Behind the reflective glass storefront, the white robot waiter stands holding an espresso cup and saucer on a tray. Slow cinematic push-in. Emphasize reflective glass, sunlight, brass table highlights, and fashionable styling. SHOT 2 — IMPACT, 1.80–3.20s The robot politely extends the espresso cup and saucer forward, accidentally trying to serve through the glass. The cup collides with the window. A sudden circular radial burst of shattered glass spreads outward from the impact point. Espresso sprays outward in slow motion. The woman remains on the left side of frame, still composed but beginning to react. Make the impact dramatic but comedic, with no injury. SHOT 3 — AFTERMATH, 3.20–5.46s Push closer toward the broken glass, robot, cup, and spill. The espresso cup tips forward and dark coffee pours in a thick realistic stream onto the brass table and floor. Glass fragments fall and sparkle in the sunlight. Coffee splashes into a growing puddle. The robot freezes stiffly, awkward and embarrassed, still holding the tray. Use realistic gravity, falling debris, liquid splashes, and reflective surfaces. SHOT 4 — REACTION CLOSE-UP, 5.46–7.55s Clean cut to a front-facing close-up of the woman. She leans slightly toward camera with amused surprise and a charming smile, as if delivering a witty reaction line. Behind her, slightly out of focus, the white robot stands near the broken glass and coffee puddle. Maintain background continuity with shattered glass, spilled coffee, and reflective storefront. OPTIONAL DIALOGUE: “Well... I guess the future still needs training.” SHOT 5 — CLOSING GESTURE, 7.55–10.00s The woman glances down at the spilled espresso and scattered glass, then raises both open palms in a playful “what can you do?” shrug. She ends with a confident amused smile. The robot remains frozen awkwardly in the background, still holding the tray. Gentle final push-in, elegant comedic ending. CAMERA: Smooth cinematic camera movement, no shaky handheld. Start with medium-wide profile, then slow-motion impact, then close detail on liquid and glass, then clean reaction close-up, ending with a gentle push-in. Maintain square composition throughout. PHYSICS: Glass must fracture in a believable circular spiderweb pattern. Shards fall with gravity. Espresso pours downward naturally, splashes on the brass table, then drips onto the floor. No floating liquid, no impossible cup movement, no unrealistic debris. AUDIO: Upscale cafe ambience, soft city street atmosphere. Sharp comedic glass burst at impact, liquid splash and pour sounds, tiny glass fragments falling, subtle comic beat during robot’s awkward freeze, optional short amused dialogue from the woman, gentle final comedic sting. NEGATIVE PROMPT: No celebrity likeness, no copyrighted character, no nudity, no gore, no injury, no horror tone, no extra people entering foreground, no duplicated robot, no warped hands, no distorted face, no melting clothing, no impossible cup geometry, no liquid floating without gravity, no random text, no subtitles, no logos, no strong camera shake, no chaotic motion blur, no broken continuity.
Reference Images