Floom audio — "open source" noise fix (v7 → v8)

The tail "and open source" in the raw take carried a measurable ~-40 dB hiss bed. In mix_v8 only "open source" is replaced with the clean ElevenLabs voice-clone (his real "We are fully model-agnostic and" is kept), spliced at the silent micro-gap between his "and" and "open". Quiet-region floor drops from ~-40 dB to ~-52 / -55 dB (clone reference ≈ -49.7 dB). Total L12 timing and every film beat are unchanged (stem length identical, 63.2924 s). A/B below.

New · mix_v8 (clone patch)

mix_v8 — clean "open source"

69.3 s · -14.1 LUFS · -1.0 dBTP (unchanged target). Splice: partA = his real VO to 52.60 s ("…agnostic and") → clone "open source" (gentle 1.12× stretch, +1 dB to level-match) landing "open"≈52.64 s, "source" end≈53.56 s → resume his real VO at 53.62 s ("because for us it's about one thing…"). Seam is silence→silence at both joins (his gap -53 dB; clone floor -49.7 dB).
regionmix_v7 (raw)mix_v8 (clone)
stem gap "and"→"open" (52.60-52.64)-39.9 dB-52.9 dB
stem gap after "source" (≈53.6)-25 dB*-55.6 dB
*v7's post-"source" region also carried his overlapping "for us" onset + hiss; v8's seam is clean clone-floor silence there. Word "open source" verified intact (transcribes cleanly with line context; caption-window aligned).
Previous · mix_v7 (raw take)

mix_v7 — raw "open source" (noisy tail)

69.3 s · -14.1 LUFS · -1.0 dBTP. Same mix recipe as v8, but the VO stem is the unmodified take: "…and open source" rides over the ~-40 dB hiss bed described above. Front of the line ("We are fully model-agnostic") is byte-identical between v7 and v8.

Earlier context (separate "source" clipping check) below.

Reference · mix_v7

mix_v7 — "source" is intact here

69.3s · -14.1 LUFS · -1.0 dBTP. The line reads "…and open source for us. It's about one thing." in the real take (v4_voice_clean.wav). "source" runs ~52.9-53.1s, flows straight into "for us", and decays naturally to the room floor by ~53.6s. No clip, no near-0-dBFS chop. Every audio path that reaches the delivered film (mix_v7, the ElevenLabs el_mixA/B, and the Remotion full-film comp) uses this same un-clipped take.
Recovered · real voice

"…and open source" — pulled from the uncut original

Federico's real voice from transcripts/DSCF0275.wav (the take that runs "…and open source. Because for us it's…"). Extracted 282.00-284.30s, so the full "source" with its natural decay tail is present. Not synthesized, not a clone.
windowpeak dBwhat
283.8-284.0s0.0loud body of "source"
284.0-284.2s-10.0natural decay of the word
284.2-284.3s-30.4settled into room tone
Recovered-clip last 0.2s: peak -14.5 dB, RMS -29.7 dB — a genuine amplitude decay, the opposite of the clipped segment.
Where the clip actually is · film segment

seg_12_L12.mp4 — clipped, but not in scope

The per-line intermediate v4/segs/seg_12_L12.mp4 ends "…and open source." right on the file boundary: its "source" peaks at -0.05 dB at 7.80s and the file ends at 7.93s with no decay — that's the chopped word the brief describes. It's a film comp (told not to touch), and it is not the audio source for the final film. So it does not affect what anyone hears.
The call: the mix does not need a "source" splice — the word is already whole in every audio stem that ships. So there's no honest mix_v9 to build; re-injecting "source" would only risk doubling the word or swapping the good 48 kHz take for a 16 kHz insert. If you still want seg_12 itself re-cut with the full decayed "source" (a film-comp fix), say so and I'll extend that segment using the recovered tail — but that's video-side work, outside "audio-only".