Automatically add paragraphs and line breaks to your transcripts to improve readability.
Raw ASR output is often a dense wall of text, making it difficult to read and understand the flow of a conversation. The Automatic Formatting feature solves this by intelligently inserting newlines and double newlines into your transcript based on the duration of pauses between spoken words.This transforms a hard-to-read block of text into a well-structured, easy-to-scan document.
The pause duration (in seconds) required to insert a double newline (\n\n), effectively creating a new paragraph. This is useful for marking a change in topic or speaker.
The formatting parameter must be sent as a JSON-formatted string within your multipart/form-data request, not as a raw JSON object. We’ll show you how to do this below.
Applying formatting makes a huge difference in readability.
Before Formatting
Copy
Alright team let's sync up on the Q3 project deliverables for Fennec aural. The primary goal is to finalize the user interface mockups by Wednesday. I've finished the preliminary analysis for the core features and have the numbers ready. We need to ensure that the new design is both intuitive and accessible. I'm reviewing the data on the acting ink report now. Great, pull them up. We need to finalize the presentation by tomorrow. The client expects a full walkthrough.
After Formatting
Copy
Alright team let's sync up on the Q3 project deliverables for Fennec aural. The primary goal is to finalize the user interface mockups by Wednesday.I've finished the preliminary analysis for the core features and have the numbers ready. We need to ensure that the new design is both intuitive and accessible.I'm reviewing the data on the acting ink report now.Great, pull them up. We need to finalize the presentation by tomorrow. The client expects a full walkthrough.
In your script, create a dictionary with your desired pause thresholds.
Copy
# Define the formatting rules in a Python dictionary# Add a newline for any pause over 0.8 seconds# Add a new paragraph for any pause over 1.5 secondsformatting_options = { "newline_pause_threshold": 0.8, "double_newline_pause_threshold": 1.5}