
The most well-known open-source video editor on the Linux desktop He makes another move.. Shotcut 25.10 It's already here and comes loaded with features that directly target productivity: local artificial intelligence, HTML generators for stylish titles, and native screen recording that avoids relying on external utilities.
After a beta testing period, the release as a stable version confirms that the project is progressing steadily according to its roadmap. The star innovation is text-to-speech (TTS) for notes and subtitles, accompanied by a typewriter filter, an image/video generator from HTML and the update to FFmpeg 8, not forgetting a good package of fixes and compatibility improvements.
Overview: What's changing in Shotcut 25.10
The team has been incorporating AI features in recent times: first came transcription with Whisper (voice-to-text) and now the circle is completed with integrated and local text-to-speechThis combination opens up very powerful workflows for narrating, subtitling, and making content accessible without leaving the editor.
Along with AI, this installment introduces creative tools supported by HTML and features requested by the community, such as cross-platform screen recording. The release also enhances the technical foundation with updated FFmpeg 8 and key libraries.
Shotcut 25.10 introduces local text-to-speech for notes and subtitles
The implementation of TTS in Shotcut relies on KokoroDoki and the Kokoro 82M modelIt's an open-source speech synthesis engine that can work in real time. It's worth noting that version 25.07 incorporated a model of speech-to-textThe most notable feature is that everything happens on your computer: there is no transmission to the cloud, so privacy and control of the project are guaranteed.
To speed up processing, KokoroDoki can use NVIDIA CPU or GPU with CUDAOn machines with compatible graphics cards, you'll notice substantially shorter generation times, which is appreciated when iterating over many lines of subtitles or when preparing long voiceovers.
Regarding the voices, the current package is more complete in English. American English has more than twenty voices.The British version offers around eight languages, and coverage for other languages ​​is more limited for now. It's expected that the range will expand in future revisions, but initially there's already enough material for models, prototypes, and final projects in English.
There is one technical detail to keep in mind: The integration is performed using Docker.This adds an extra step in setup compared to other features, especially if you've never worked with containers before. It's not an insurmountable obstacle, but it's worth setting aside a few minutes to fine-tune everything.
Shotcut 25.10, generating from HTML and typewriter effect
Creativity expands with two contributions that go hand in hand: a new "Image/Video from HTML" generator and a text effect that simulates a typewriter. The approach is clear: to bring browser capabilities to the timeline to produce titles, captions, and animations with transparent backgrounds and without any unusual plugins.
The HTML generator creates Images or clips with transparency by defaultTo maintain stability, the generated videos are limited to 15 frames per second, a sufficient value for graphics and overlays that avoids bottlenecks. The result is immediately visible. It opens automatically in the Origin viewer along with its HTML, so you can review or tweak the code on the fly.
The "Text: Typewriter" effect includes ready-to-use presets and others designed for customization. Among the included profiles are: 3D image, elastic stroke, folded, gold metal o party timeThese tools cover everything from solid image finishes to more playful video animations. If you're comfortable with CSS and JS, you have the opportunity to create very serious work.
- Multi-device Compatibility Code compiled from CodePen, ideal for reusing snippets and designs you already have in your stash.
- Render with a limit of 15 FPS for stability in browser-based generation.
- Creative possibilities with transparent backgrounds without complicated compositions.
Native and cross-platform screen recording
Another classic community request has finally arrived: integrated screen recordingThe implementation varies depending on the environment, but in all cases the most reliable native backend has been sought to reduce friction.
- X11- Using FFmpeg as main backend.
- Wayland (GNOME): integration with the native capture/screencast tool.
- KDE Plasma: support in Spectacle for capture.
- AlternativeIf none of the above are present, then resort to OBS Studio as a wild card.
This avoids installing additional utilities in most cases and unifies the workflowYou record, edit, and export all from the same application. For creators of tutorials, software reviews, or training sessions, this is an improvement that's noticeable from day one.
Shotcut 25.10 updates the multimedia engine: FFmpeg 8 and libraries
There's movement under the hood too. The app switches to ffmpeg 8This means improvements in encoding/decoding, new filters, and broader compatibility with modern formats. On a practical level, this translates to more consistent exports and smoother playback of complex projects.
Essential components are also being updated, such as SVT-AV1, libaom, dav1d, libvpx, libwebp y whisper.cppThese pieces affect AV1, VP9, ​​WebP codecs or the AI ​​transcription itself, closing the circle for more robust workflows.
Shotcut 25.10 interface and workflow improvements
Small details that add up: the options are now available in the Properties panel "Open with" and "Recharge"These are modest but practical changes when you need to refresh externally edited media or launch it quickly with a system app.
The new "Image/Video from HTML" generator It coexists with the "Text: Typewriter" filter, so you can combine HTML generation outside the timeline (as a media asset) with the typewriter animation applied to an already inserted clip. Total flexibility depending on the use case.
Shotcut 25.10 bug fixes and stability improvements
The release fixes several annoying problems that had been reported, similar to the 25.08 update. For example, Export failures when the name or path contained the ampersand (&) character, a more common problem than it seems in shared projects.
It also solves a Problem starting AppImage on Linux when AppImageLauncher was installed, and they are fixed advanced keyframes in «Text: Rich» who were not behaving as they should. Finally, a Error in the alpha channel when decoding Ut Video, key for those who work transparently in professional pipelines.
Installation and compatibility
To make life easier in Linux, the project distributes a Universal AppImage which you can download and run without installing dependencies. In practice, it's just a matter of giving it execute permissions and you're done.
On the compatibility front, support is confirmed for glibc 2.35 on Ubuntu 22.04 LTSTherefore, that LTS version is covered. Any modern distribution with kernel higher than 5.10 It should go smoothly, whereas on older systems you may need to adjust libraries.
Use cases: from creators to educators and developers
For those who create content on social media or YouTube, the package is perfect. You record the screen without leaving the editorYou apply stylish text effects and generate synthetic voices for intros, explanations, or calls to action without having to narrate each change.
- Video tutorials: integrated recording + TTS annotations for dynamic rhythms.
- PresentationsProfessional animated titles without paid plugins.
- AccessibilityBy combining Whisper (voice to text) and TTS you can improve subtitling and narration.
In education and training, the value is clear: recorded classes, video tutorials and supporting material with locally generated labels and voiceovers. Several English voice profiles allow for varying the tone depending on the audience.
- Software demosNative application capture without relying on external tools.
- Visual documentationVideos with technical annotations and readable bursts of text.
- Interface testing: record and review workflows for quick feedback.
Quick comparison with the previous version
Compared to the previous version, the improvements are substantial. The native recording It eliminates the need for additional solutions on most desktops, and the HTML+Typewriter combination multiplies the options for titles and lower thirds.
- ✅ Integrated recording without going through OBS (except in alternative scenarios).
- ✅ Advanced text effects with HTML/CSS support and quality presets.
- ✅ Text to speech for notes and subtitles within Shotcut itself.
- ✅ ffmpeg 8 for better performance and current formats.
- ✅ Enhanced stability with fixes in common edge cases.
Requirements and recommended configuration
For everything to flow smoothly, it's best to aim for a balanced team. With 8 GB of RAM you can workBut if you work on medium/large projects, you'll appreciate having 16 GB.
- CPU: at least 4 cores for effects and loose encodings.
- RAM8 GB minimum, 16 GB recommended.
- GPU: Compatible with OpenGL 3.3+; if you use CUDA, it will accelerate TTS with KokoroDoki.
- StorageSSD for media and cache, especially in projects with heavy codecs.
In project settings, it's a good idea match FPS and resolution to your final destination to avoid unnecessary rescaling. Maintaining proxies and optimized preview will save you from lag when adding filters and multiple tracks.
Workflows with HTML, CodePen, and Typewriter
If you're going to really push the HTML generator, Work on the design in your editor or on CodePen.Compile/package the resources and paste them into the Shotcut template. This ensures that fonts, styles, and scripts are resolved without external calls.
For text animations, alternate the filter "Text: Typewriter" And external generation: when you want speed and something lightweight, the filter gets you out of trouble; if you need complex finishes, the HTML flow will give you more control (and native transparency).
Notes on performance, privacy, and limits
The local approach to TTS has two major advantages: protect your content It avoids latency or online service fees. In return, it requires Docker configuration and, for maximum speed, an NVIDIA GPU with CUDA capabilities.
The limit of 15 FPS in HTML generation It's not just a whim: it stabilizes rendering when the browser engine generates video or images with transparencies and layers. In practice, it's more than enough for titles, transitions, and overlays.
Small details that make a difference
The option "Open with" in Properties It speeds up editing a clip in an external tool and going back without losing the thread, while "Recharge" It allows you to force an update of media if the file has changed on disk. Those two keys, plus the Source viewer that automatically opens HTML-generated content, make everyday tasks much easier.
The sum of FFmpeg 8 and the revamped libraries It also translates into peace of mind: fewer surprises when importing/exporting and better handling of recent formats like AV1, along with finer decoding.
Those coming from previous versions will find an editor that It has matured without losing its essenceIt's free, cross-platform, and features technical design choices that favor local development. The feeling is that the project is hitting the gas where it needs to: performance, practical AI, and creative tools with gentle learning curves.
