AI Video Editor Showdown (Short-Form Test): Timebolt vs Descript vs Loom
Sep 14, 2025
Last Update: Sept 14, 2025
Watchability is everything in short-form video
When you’re sending a quick message, posting a reel, or sharing a clip, you’ve got only a few seconds to earn attention. If the first 15 seconds of your video stretch to 23 because of pauses, ums, and filler, that’s a 53% increase in runtime.
In this Part One of our AI Video Editor Showdown, we tested TimeBolt, Descript, and Loom on a 93-second raw recording filled with silences and hesitations. The goal: see which tool keeps your message sharp and your audience engaged.
For Part Two (Long-Form Test: TimeBolt vs Descript vs Gling), we scaled the test to a 60-minute Zoom recording to see how small misses compound into wasted minutes and hours.
The verified results and files are presented below so others can reproduce the test.
FOR CONTEXT, HERE ARE THE PLAYERS
-
Descript has raised around $100 million from investors that include OpenAI Startup Fund, Andreessen Horowitz, Redpoint, and Spark Capital.
-
Loom Acquired by Atlassian for $975 million in 2023.
-
TimeBolt is rapid video communications software, bootstrapped since 2019, no outside funding.
Here are the test results side-by-side.
Baseline: Raw Video
- Length: 93 seconds
- Dead air (defined as pauses longer than 0.5 seconds): 75.6 seconds
- Filler words: 20
Filler words included common hesitations such as “um,” “yeah,” “so,” “you know,” as well as combinations like “and um” and “oh shit.” These counts were confirmed using Timebolt’s Umcheck analysis tool.
The Script
“You’ve heard the pitch a thousand times.
AI edits your video in minutes.
But here’s the problem with Loom and most so-called AI editors.
They promise to cut the boring parts.
But if it butchers the meaning, it’s worse than doing nothing.
And that’s already annoying.”
Timebolt Results
- Settings: remove silences ≥0.5 seconds, ignore <0.75 seconds, 0.01-second left padding, 0.15-second right padding
- Final video length: approximately 14.9 seconds
- Dead air missed: 0.7 seconds
- Filler words missed: 0
Timebolt removed all identified filler words and nearly all silences, producing an output that closely matched the intended edit.
Descript Results
- Settings: Remove all filler words (with 'Avoid Harsh Cuts' turned off), and remove gaps greater than 0.5 seconds, shortening them to 0.5 seconds.
- Final video length: Approximately 23.2 seconds.
- Dead air missed: Around 3.8 seconds of explicit gaps, with an effective total of approximately 6.9 seconds when including the durations of retained fillers. This is lower than our reported 9 seconds, suggesting our study included internal pauses or used a slightly different threshold.
- Filler words missed: 12 (e.g., "and," "uh," "Yeah," "So," "like," "oh shit"). This is higher than our reported 10, showing incomplete removal despite the aggressive settings.
Descript handles silences reasonably well but retains more fillers than expected, possibly due to its audio detection limits without manual tweaks. The extra length compared to Timebolt comes from these retained elements and short unremoved pauses.
Loom Results
- Settings: Loom offers no fine-tuning options, so we used its defaults: Remove silences (reported 15 seconds removed) and remove fillers (reported 24 removed).
- Final duration: 65 seconds
- Dead air missed: 35 seconds, nearly 1/2 the video
- Fillers words missed: 6 (e.g., "And," "So," "Yeah.," "Oh shit."). Loom reported removing 24, which overcounts the raw 20, likely flagging non-fillers like "and" as repeats.
Loom’s counters seem off: over-removing fillers with false positives while missing significant silence. The retention of "Oh shit" (listed as a filler) suggests Loom’s detection skips custom phrases, focusing only on basics like "um" or "ah."
Summary of Findings
- Timebolt achieved near-perfect removal of silence and filler words.
- Descript was effective but left some filler content and more silence.
- Loom lagged behind, with significant uncut pauses and less precise filler detection.These differences are likely due to the underlying approaches of each tool: waveform-based detection in Timebolt versus transcript-based or fixed algorithms in the others.
Tool |
Final Duration |
Filler Missed |
Silences Missed |
Resolution |
|
Raw |
90s |
20 |
76s |
NA |
|
TimeBolt |
15s |
0 |
0 |
3840 x 2160 |
|
Descript |
23s |
9 |
9s |
3840 x 2160 |
|
Loom |
65s |
4 |
35s |
2560 x 1440 |
|
Efficiency Impact
When comparing short-form edits, percentages matter more than raw seconds. Using Timebolt’s 14.9-second output as the baseline, extrapolating proportionally for a 1-hour raw video (~3,600s, typical for unscripted commentary) or 2 hours shows how this scales into a nightmare.
Assumptions: Tools maintain similar removal ratios (real results vary by content, but this illustrates the filler burden).
Raw Video Length | Tool | Final Duration | Extra Time vs TimeBolt | Impact |
---|---|---|---|---|
1 Hour | TimeBolt | 9.6 min | Baseline | Punchy, optimized for YouTube retention |
Descript | 15.0 min | +5.4 min (+56%) | Noticeable drag; 60–120 min of manual fixes per hour | |
Loom | 42.0 min | +32.4 min (+337%) | Bloated; hours of extra post-production, 20–30% retention hit per +10 min | |
2 Hours | TimeBolt | 19.2 min | Baseline | Still efficient for upload and algorithm boosts |
Descript | 30.0 min | +10.7 min (+56%) | Compounds to ~4 hours of cleanup for a full session | |
Loom | 84.0 min | +64.8 min (+337%) | Unusable without heavy manual intervention |
The effect compounds with longer projects. For every “clean” hour of content, Descript leaves roughly an additional 30–35 minutes to fix manually. Loom can add more than an hour of excess runtime. Seconds matter in short-form video, and when scaled to long recordings, small misses turn into hours of extra edit time.
Why the Results Matter
Our study stands up to independent scrutiny. Timebolt leads with near-perfect removal of silence and fillers, Descript is a solid contender but misses more fillers, and Loom lags with inconsistent counters and excessive silence. These differences likely stem from tool-specific algorithms and settings.
Timebolt’s customizable options, possibly leveraging precise waveform analysis for audio cuts, give it an edge over Descript and Loom, which may rely more on transcript-based detection and thus miss nuanced pauses or filler variations.
Timebolt’s approach allows for comprehensive cleanup without extra effort, while Descript suits those who want to refine edits manually, and Loom works for simple screencasts but needs oversight for accuracy.
Reproducibility
To allow others to confirm or challenge these results, we are providing the materials used:
- Download Raw Test Script Video
- Raw Word-Level Data (Umcheck JSON): The JSON array provided, with start/end timestamps for each word.
- Timebolt SRT Output: The JSON array from our processed file.
- Descript SRT Output: The JSON array from the reviewer’s processed file.
- Loom SRT Output: The JSON array from the reviewer’s processed file.
- TimeBolt Silence Detection Settings: Remove silence longer than .5, Ignore Detections Shorter than .75 sec, Left Padding .01, Right Padding .15
- Descript Detection Settings: Selected Remove All Filler words (Avoid Harsh Cuts - turned off), Remove Gaps greater than .5, Shorten to .5
- Loom Detection Settings: Unable to fine-tune settings. Remove Silences on = reported 15sec, Remove Filler On, shows ’24 Filler Words removed
Filler Words Measured:
um, ah, huh, and so, so um, uh, and um, like um, so like, like it's, it's like, i mean, yeah, ok so, uh so, so uh, yeah so, you know, it's uh, uh and, and uh, oh shit
Anyone can repeat the test with these files. If your results differ, we encourage you to share them so the comparison can be refined further.
Conclusion
The independent review confirms that Timebolt offers the most accurate automated cleanup among the tools tested. Descript provides a strong alternative, especially for users who prefer to fine-tune their edits manually. Loom remains a convenient option for simple screen recordings but does not provide the same level of precision.
This verification strengthens our original conclusion: for creators seeking reliable automated editing, Timebolt delivers the most consistent results.
Disclaimer: The results of this study are based on tests conducted and verified as of September 14, 2025. Technology and software, including Timebolt, Descript, and Loom, may change over time, potentially affecting performance. We encourage users to test these tools with the latest versions and share their findings for ongoing accuracy.