Get subtitles and transcripts with our audio to text converter
Easily transcribe audio to text with Amberscript. Our audio to text converters supports multiple audio and video file formats which you can use to create high quality subtitles and transcripts.

Find out which file formats we can transcribe audio to text
Get subtitle and transcripts using our audio to text converter!
Click on the supported file format for more information
Supported audio file formats
-
AAC
Advanced Audio Coding is a lossy digital audio compression format, which you can upload to the audio to text converter. Originally this file format was supposed to replace the MP3 format, as it offers higher sound quality at the same bit rate. Read more on how to convert AAC to text.
-
DSS
Digital Speech Standard developed in 1994 to store voice audio data in a highly compressed format. Used primarily for digital dictation recorders, thanks to basic recording functionality and the ability to enter into overwrite mode.
-
FLAC
Free Lossless Audio Codec is a lossless compression format of digital audio. Files compressed using this format can be reduced by 50 to 70 percent of their original size and contain an identical copy of the original audio data. More about converting FLAC to text here.
-
GSM
GSM is the file extension used for audio files that store strong digital audio data on a computer or mobile device. This file format is generally used by cell phones.
-
MP2
An MP2 file is an audio file compressed using MPEG-1 Audio Layer II compression. This type of file is still relatively standard in the realms of digital radio and television broadcasts. However, for many individuals, MP3 files are used with much more frequency.
-
M4A
It’s a MPEG-4 audio file encoded with AAC or ALAC (Apple Lossless Audio Codec). Most commonly used for audio content like songs, podcasts, or audiobooks. Convert the M4A file to text.
-
MP3
MPEG-1/2 Audio Layer III is one of the most popular coding formats for digital audio. Uses lossy data compression. This allows a significant reduction of file size compared to the source audio. More information about converting MP3 to text is here.
-
MPA
MPA files are audio files compressed using MPEG-2 compression. Using this compression method allows users to reduce the file size of an audio file while preserving the overall sound quality. MPA was formerly based on the MPEG Layer I coding standard but has been updated to MPEG Layer II and later MPEG Layer III.
-
WAV
Waveform Audio File Format was developed by IBM and Microsoft and originally was used on Microsoft Windows systems for storing raw and uncompressed audio. Convert your WAV to text.
Video file formats
-
MV4
This is a video container format developed by Apple, and quite similar to the MP4 format. Originally used in iTunes Store, as such files may be protected by DRM. Learn how to convert MV4 to text here.
-
MOV
QuickTime File Format that was originally used by QuickTime framework. It can contain multiple tracks, where each of them stores a different types of data: audio, video, or text. Learn how to convert MOV to text here.
-
WMV
A WMV file is a video saved in the Microsoft Advanced Systems Format (ASF) and compressed with Windows Media Video (WMV) compression.
-
AVI
An AVI file is a video file saved in the Audio Video Interleave (AVI) multimedia container format created by Microsoft. It stores video and audio data encoded in various codecs, including DivX and XviD.
-
MP4
It’s a MPEG-4 Part 14 digital multimedia container format that can store video, audio, and even subtitles, or still images. Learn how to convert MP4 to text here.
-
TS
A TS file is a type of video stream file. It’s used to store video content on DVD and Blu-ray discs. It makes use of standard MPEG compression in order to compress video data, mostly being used for saving streamed or broadcast video content.
-
FLV
FLV files are video files saved in the Adobe Flash Video (FLV) container format. In this video format, each file consists of a short header, synchronized audio and video data streams (encoded like the standard . SWF format used in Adobe Flash), and metadata packets.
Audio/Video file formats
-
MPEG
This format was developed by The Moving Picture Experts Group and is used in various multimedia systems. There are multiple MPEG standards, but the most popular ones are MPEG-1, MPEG-2, and MPEG-4. Read how to convert your MPEG file to text.
-
MPG
One of many of file extensions for MPEG-1 or MPEG-2 audio and video compression. Convert your MPG to text with Amberscript.
-
OGG
It’s a free, open container format. This means it can carry streams for audio, video, text (subtitles or captions), and metadata. Click here to find out how quickly you can convert your OGG file to text.
-
3GP
The Third Generation Partnership Project (3GPP), 3GP is a video file format supported in 3G enabled devices. It contains video streams of H. 263 or H. 264 codec and AMR or AAC based audio streams, which are perfect for lossy compression of video data.
-
VOB
Video Object – container format for DVD-Video. Typically can be found on the DVD disc. It’s based on MPEG format, but such files can be encrypted. Larger pieces of multimedia saved in VOB format are split into 1 GB files, to make sure all operating systems can read them.
-
WMA
Windows Media Audio was developed by Microsoft, which was a part of the Windows Media framework. Primarily used for encoding only audio files, as metadata may include song name, track number, and artist name. Today encoding video in this format is also possible. Convert your WMA to text.
-
MXF
MXF is an industry-standard file format for video and audio. Like QuickTime files, MXF files contain information about the media inside. This information, also called metadata, can include frame rate, frame size, creation date, and custom data added by a camera operator, an assistant, or an archivist.
-
WebM
WebM is an audiovisual media file format. It is primarily intended to offer a royalty-free alternative to use in the HTML5 video and the HTML5 audio elements.
How to transcribe audio to text

1. Upload your audio files
Create a free account and upload your audio file to Amberscript, this acts as your audio to text converter. Select the language of the audio and the number of speakers present. Our audio to text converter recognizes speech in 39 languages, and supports multiple file formats like MP3, AAC, WAV.
Try it free
2a. Edit the text yourself
The automatic speech recognition (ASR) software listens to the audio and converts it to text. Depending on the files audio quality this will be up to 85% accurate, you can then perfect the text yourself using the easy to use online editor. Even better, you can highlight the most important parts, rename your speakers and adjust the timestamps. This is 5-10 times faster than listening to the audio and typing the text yourself.
Start now
2b. Choose Human Made services
If you have a high volume of audio or simply do not have time to review the text yourself, order the Human made services and receive your files ready in a few days. This option grants you 100% accurate audio to text and is completely handled by our team of professionals. Rush order and translated subtitles are also possible.

3. Export your audio/video file to your favourite text format
We support multiple text formats you can export and download your file in:
- .docx – Microsoft Word file format
- .json – JavaScript Object Notation
- .srt – SubRip Subtitle file
- .vtt – Web Video Text Tracks
- EBU-STL – European Broadcast Union subtitles
- .txt – Plain text file
Find out more
Converting audio or video with ease
Read more about how to convert audio or video file to text
Try it freeAutomatically convert your audio to text using our high-end AI engines.
Read more