Skip to content

Gemini 2.0 Flash and Gemini 2.0 Flash Lite completely hallucinates Timestamps when Transcribing audio #426

Open
@bigmandevs

Description

@bigmandevs

Description of the bug:

The GA versions of both models completely hallucinate timestamps when performing transcriptions on Audio.

Actual vs expected behavior:

The timestamps should be accurate based on when that word or phrase was spoken. The preview models for both where excellent at this. The same models after going into GA completely hallucinate timestamps.

Any other information you'd like to share?

Interestingly, this does not apply to extracting timestamps from video, and is applicable to audio only.

You can replicate this by trying any audio file and comparing the accuracy to videos.

Metadata

Metadata

Assignees

No one assigned

    Labels

    component:modelThe issue is related to the Gemini Modelsstatus:triagedIssue/PR triaged to the corresponding sub-team

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions