Aviary Features

Automated Metadata

Last edited 4 days ago by Jesse Moore
Search
Automated metadata overview
Aviary offers organizations the ability to generate automated transcripts and translations for resources directly in the platform. This process involves generating transcripts and translations in Aviary using services Whisper, IBM Watson, Deepgram, AssemblyAI, and Trint. The following sections of the Automated Metadata user guide provide more information about the process for requesting automated transcripts and translations. This includes selecting an automated service to use, enabling transcription features, paying for automated metadata transactions, and the organizational approval processes. Note: Media files must be hosted by Aviary to request automated metadata services. It is not possible to send content embedded in Aviary to automated services (e.g., media files embedded from Youtube, Avalon, Vimeo, or URLs).
Automated metadata payments
Payments for automated metadata requests are processed in Aviary. Organizations have the option to pay for each automated metadata request on an individual basis, or purchase Prepaid Automated Metadata Credit that will be applied to requests on an ongoing basis. 1. Organization Owners can review and update payment information on the Manage Account page. To open the Manage Account page, select the “Account” tab in the staff menu. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/888e39f7-cc48-4d70-926e-b7ff986421c8/user_cropped_screenshot.jpeg?tl_px=0,784&br_px=529,1080&force_format=jpeg&q=100&width=529&wat_scale=47&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=62,205 2. The Manage Account page will open and display the Account Information tab. Click the “Payment Information” tab to review, add, or update payment information. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/106aa06a-0e57-45dc-8282-6046b1d848dd/user_cropped_screenshot.jpeg?tl_px=0,114&br_px=1146,755&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=502,164 3. Click the “Add Payment Method” button to add a credit card to your Aviary organization. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/35e6e509-0dc6-45c6-a157-485f1fc8cda5/user_cropped_screenshot.jpeg?tl_px=200,0&br_px=1919,839&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=969,59 4. The Payment Method modal window will open. Complete the billing information and click the “Submit” button. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/82af17bb-78a0-494c-a5a8-40ce97e5ef54/user_cropped_screenshot.jpeg?tl_px=243,103&br_px=1619,872&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=701,522 5. To purchase prepaid automated metadata credit, return to the Account Information tab and click the “Purchase Credit” button. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/816b718d-ec0d-4e48-913a-a199930b2881/user_cropped_screenshot.jpeg?tl_px=0,167&br_px=1146,808&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=677,469 6. The Prepaid Automated Metadata Credit modal window will open. Use the Credit Amount dropdown menu to select the amount of prepaid automated metadata credit to purchase. Credit is available in $50 increments between $50 and $1,000 USD. Click the “Purchase Credit” button to complete the transaction. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/e2ce8ef9-125b-4b9f-b89f-a4f600ea61a6/user_cropped_screenshot.jpeg?tl_px=446,118&br_px=1429,667&force_format=jpeg&q=100&width=983&wat_scale=87&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=640,456 7. The Prepaid Automated Metadata Credit will be applied to the organization and displayed on the Account Information tab. To review the organization’s purchase history, click the “Billing History” tab. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/c63defea-559a-4d4b-bb9a-e93f9d3cf11f/user_cropped_screenshot.jpeg?tl_px=0,107&br_px=1376,876&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=521,139 8. The Billing History tab will display the organization’s previous transactions, including prepaid automated metadata credit purchases and the charges applied for each transcription request. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/1da8696a-041f-41e5-8286-34c4e637c8a0/user_cropped_screenshot.jpeg?tl_px=0,187&br_px=1376,956&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=471,277
Whisper transcription
It’s possible to request automated speech-to-text transcription directly from Aviary using OpenAI's Whisper service. Whisper supports features including: Sentence/phrase level transcription Skip common words: Skips common words (like “umm”) by default to improve readability Proper noun prompts: Users can provide up to 244 characters of prompts to improve transcription for proper nouns Supports almost 100 languages: See Whisper language support for more information Requesting an automated transcript from Whisper: Note: Automated transcription is a two-step process. Any organizational user can create an automated transcription request, but organizational owners must approve the pending requests. See the Approving automated metadata requests page for steps to approve pending requests.
Whisper translation to English
It’s possible to request automated English translations for transcripts in Aviary using OpenAI's Whisper service. Whisper supports features including: Skip common words: Skips common words (like “umm”) by default to improve readability Proper noun prompts: Users can provide up to 244 characters of prompts to improve transcription for proper nouns Supports almost 100 languages: See Whisper language support for more information Note: Whisper does not currently provide consistent results when translating from Spanish to English. Please be aware that automated translation results may contain inconsistencies. Requesting an automated translation from Whisper: 1. Navigate to the Resource Detail page for the resource to be translated. If more than one media file is associated with the resource, select the media file to be translated in the Media File carousel below the embedded media player. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/7be01296-bc04-4f5f-aa80-fc3809975a5f/ascreenshot.jpeg?tl_px=6,179&br_px=1153,820&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=607,46 2. Select the “Transcript” tab at the top of the Resource Detail page. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/aaa5b929-4107-48b4-867a-7350a83b8674/ascreenshot.jpeg?tl_px=773,226&br_px=1920,867&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=601,70 3. Click the vertical ellipsis (⋮) in the Transcript menu. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/fa96f361-47c5-4383-95ba-ef5fce610d55/user_cropped_screenshot.jpeg?tl_px=1045,230&br_px=1905,710&force_format=jpeg&q=100&width=860&wat_scale=76&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=778,150 4. Click “Translate to English - Whisper” in the dropdown menu. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/6cf64f67-f4ff-4cec-9068-25a90267792c/user_cropped_screenshot.jpeg?tl_px=773,250&br_px=1920,891&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=929,268 5. Enter up to 244 characters of proper nouns to improve the translation. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/1a75b300-45f5-429c-b31d-85eb41077cbf/user_cropped_screenshot.jpeg?tl_px=500,118&br_px=1360,599&force_format=jpeg&q=100&width=860&wat_scale=76&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=254,178 6. Review the duration of the media file and the price of the transcription job. Click the “Create Transcription Job” button to submit the automated translation request. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/437e8860-5d15-4361-85be-a1c3d3410e7d/user_cropped_screenshot.jpeg?tl_px=514,120&br_px=1374,600&force_format=jpeg&q=100&width=860&wat_scale=76&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=523,400 7. A confirmation message will be displayed on the Resource Detail page after submitting the transcription request. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/378fe884-210c-45cf-b451-4360e32c4f34/ascreenshot.jpeg?tl_px=1369,116&br_px=1898,411&force_format=jpeg&q=100&width=529&wat_scale=47&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=139,67 Note: Automated transcription is a two-step process. Any organizational user can create an automated transcription request, but organizational owners must approve the pending requests. See the Approving automated metadata requests page for steps to approve pending requests.
IBM Watson transcription
It’s possible to request automated speech-to-text transcription directly from Aviary using IBM Watson. IBM Watson supports features including: Word-level transcription Speaker diarization: Differentiates between speakers in the transcript Smart formatting: Generates special formatting for dates, times, series of digits and numbers, phone numbers, currency values, internet email, and web addresses Profanity filter: Censors profanity from the transcript Speaker hesitation markers: Add or remove speaker "%HESITATION" markers in the transcript Supports multiple languages: See IBM Watson language support for more information Requesting an automated transcript from IBM Watson: Navigate to the Resource Detail page for the resource to be transcribed.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-01-30/cda3d429-e233-49a7-bb0e-b26c70d66d95/ascreenshot.jpeg?tl_px=0,0&br_px=1376,769&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=508,93 If more than one media file is associated with the resource, select the media file to be transcribed in the Media File carousel. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-01-30/244ee06c-fec8-447e-8672-1480b63d8b26/ascreenshot.jpeg?tl_px=0,198&br_px=1376,968&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=402,371 Select the “Transcript” tab at the top of the Resource Detail page. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/e72d42dc-c124-4366-8b37-6eb83de2f5cc/ascreenshot.jpeg?tl_px=751,238&br_px=1898,879&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=605,524. Click the vertical ellipsis (⋮) in the Transcript menu. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/a595ad33-49dc-4b18-9589-ef8ef7d043c6/ascreenshot.jpeg?tl_px=931,307&br_px=1914,856&force_format=jpeg&q=100&width=983&wat_scale=87&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=891,80 5. Click “Request new transcript” in the dropdown menu. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/80598ddc-cb1b-4582-8a02-44438afad110/ascreenshot.jpeg?tl_px=937,306&br_px=1920,855&force_format=jpeg&q=100&width=983&wat_scale=87&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=805,123 6. Select “IBM Watson” from the list of transcription services. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/98fd49d4-c78b-4cf4-83ba-a65853e935a7/ascreenshot.jpeg?tl_px=937,303&br_px=1920,852&force_format=jpeg&q=100&width=983&wat_scale=87&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=729,198 7. The “Request Transcript” modal window will open. Select the language to be transcribed in the Language dropdown menu. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/2fa43f6f-8531-4a74-bb14-73f0e537e8d3/ascreenshot.jpeg?tl_px=353,129&br_px=1500,770&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=603,98 8. Check the "Speaker Diarization" checkbox to enable speaker identification. This feature will differentiate between speakers in the transcript. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/781bd50c-8e2e-4c5f-bab8-cb677cf39888/ascreenshot.jpeg?tl_px=357,127&br_px=1504,768&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=322,154 9. Check the “Smart Formatting” checkbox to enable smart formatting for dates, times, numbers, and email or web addresses. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/90358a7d-48ad-4ace-8eb9-ec03931bd9e9/ascreenshot.jpeg?tl_px=352,127&br_px=1499,768&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=327,211 10. Check the “Profanity Filter” checkbox to censor profanity in the transcript. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/92442998-f9d7-4db7-aeac-451698ed6437/ascreenshot.jpeg?tl_px=362,128&br_px=1508,769&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=320,283 11. Check the “Remove Hesitation Markers” checkbox to remove %HESITATION markers from the transcript. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/e8258264-f092-4fa2-b675-ea55d04b8c5b/ascreenshot.jpeg?tl_px=368,126&br_px=1515,767&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=312,342 12. Review the duration of the media file and the price of the transcription job. Click the “Create Transcription Job” button to submit the automated transcription request. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/ab2d62e6-ff0c-4bfe-b986-97816f887283/ascreenshot.jpeg?tl_px=375,128&br_px=1522,769&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=677,557 13. A confirmation message will be displayed on the Resource Detail page after submitting the transcription request. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/ad6ff1a0-4df7-4e68-b7b5-894c904a77bc/ascreenshot.jpeg?tl_px=1232,118&br_px=1920,503&force_format=jpeg&q=100&width=688&wat_scale=61&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=233,52 Note: Automated transcription is a two-step process. Any organizational user can create an automated transcription request, but organizational owners must approve the pending requests. See the Approving automated metadata requests page for steps to approve pending requests.
Deepgram transcription
It’s possible to request automated speech-to-text transcription directly from Aviary using Deepgram. Deepgram supports features including: Automated index or summary: Creates an index or summary that is added to the Aviary resource Supports multiple languages: See Deepgram language support for more information Requesting an automated transcript from Deepgram: Navigate to the Resource Detail page for the resource to be transcribed.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-01-30/cda3d429-e233-49a7-bb0e-b26c70d66d95/ascreenshot.jpeg?tl_px=0,0&br_px=1376,769&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=508,93 If more than one media file is associated with the resource, select the media file to be transcribed in the Media File carousel. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-01-30/244ee06c-fec8-447e-8672-1480b63d8b26/ascreenshot.jpeg?tl_px=0,198&br_px=1376,968&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=402,371 Select the “Transcript” tab at the top of the Resource Detail page. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/e72d42dc-c124-4366-8b37-6eb83de2f5cc/ascreenshot.jpeg?tl_px=751,238&br_px=1898,879&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=605,524. Click the vertical ellipsis (⋮) in the Transcript menu. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/a595ad33-49dc-4b18-9589-ef8ef7d043c6/ascreenshot.jpeg?tl_px=931,307&br_px=1914,856&force_format=jpeg&q=100&width=983&wat_scale=87&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=891,80 5. Click “Request new transcript”. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/80598ddc-cb1b-4582-8a02-44438afad110/ascreenshot.jpeg?tl_px=937,306&br_px=1920,855&force_format=jpeg&q=100&width=983&wat_scale=87&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=805,123 6. Choose “Deepgram from the list of automated transcription services. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/5c064f30-ec3a-404a-a238-4ee2cdc044ed/ascreenshot.jpeg?tl_px=544,188&br_px=1920,957&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=874,276 7. The “Request Transcript” modal window will open. Select the language of the recording that will be transcribed. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/9bc09d19-48be-4c2f-8c60-8e319f7749a6/ascreenshot.jpeg?tl_px=461,106&br_px=1444,656&force_format=jpeg&q=100&width=983&wat_scale=87&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=521,127 8. Deepgram offers the option to create an automated index or automated summary of the recording. Click the appropriate radio button to create an automated index or summary. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/9aa7babc-9772-4220-b947-0083d6f45524/ascreenshot.jpeg?tl_px=406,113&br_px=1389,662&force_format=jpeg&q=100&width=983&wat_scale=87&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=287,175 9. Review the duration of the media file and the price of the transcription job. Click the “Create Transcription Job” button to submit the automated transcription request. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/1f953f8b-e396-47c6-97b8-bfa5484c8298/ascreenshot.jpeg?tl_px=425,112&br_px=1408,661&force_format=jpeg&q=100&width=983&wat_scale=87&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=639,457 10. A confirmation message will be displayed on the Resource Detail page after submitting the transcription request. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/ad6ff1a0-4df7-4e68-b7b5-894c904a77bc/ascreenshot.jpeg?tl_px=1232,118&br_px=1920,503&force_format=jpeg&q=100&width=688&wat_scale=61&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=233,52 Note: Automated transcription is a two-step process. Any organizational user can create an automated transcription request, but organizational owners must approve the pending requests. See the Approving automated metadata requests page for the steps to approve pending requests.
AssemblyAI transcription
It’s possible to request automated speech-to-text transcription directly from Aviary using AssemblyAI. AssemblyAI supports features including: Speaker diarization: Differentiates between speakers in the transcript Automated index or summary: Creates an index or summary that is added to the Aviary resource Supports multiple languages: See AssemblyAI language support for more information Requesting an automated transcript from AssemblyAI: Navigate to the Resource Detail page for the resource to be transcribed.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-01-30/cda3d429-e233-49a7-bb0e-b26c70d66d95/ascreenshot.jpeg?tl_px=0,0&br_px=1376,769&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=508,93 If more than one media file is associated with the resource, select the media file to be transcribed in the Media File carousel. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-01-30/244ee06c-fec8-447e-8672-1480b63d8b26/ascreenshot.jpeg?tl_px=0,198&br_px=1376,968&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=402,371 Click the vertical ellipsis (⋮) in the Transcript menu.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/a595ad33-49dc-4b18-9589-ef8ef7d043c6/ascreenshot.jpeg?tl_px=931,307&br_px=1914,856&force_format=jpeg&q=100&width=983&wat_scale=87&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=891,80 Click “Request new transcript” in the dropdown menu.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/80598ddc-cb1b-4582-8a02-44438afad110/ascreenshot.jpeg?tl_px=937,306&br_px=1920,855&force_format=jpeg&q=100&width=983&wat_scale=87&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=805,123 6. Choose “AssemblyAI” from the list of automated transcription services. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2024-11-25/ec10f914-0ce0-432b-ac7c-ad8d921d8ec4/ascreenshot.jpeg?tl_px=528,291&br_px=1388,772&force_format=jpeg&q=100&width=860&wat_scale=76&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=571,212 7. The “Request Transcript” modal window will open. Select the language of the recording that will be transcribed. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2024-11-25/3c96588f-1a48-4a7f-bdcb-fb6f03bfb299/ascreenshot.jpeg?tl_px=0,0&br_px=1146,640&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=518,134 8. Check the "Speaker Diarization" checkbox to enable speaker identification. This feature will differentiate between speakers in the transcript. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2024-11-25/9524d1e1-1c6c-438b-9be7-c90abd663f0e/ascreenshot.jpeg?tl_px=0,0&br_px=982,549&force_format=jpeg&q=100&width=983&wat_scale=87&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=439,186 9. AssemblyAI offers the option to create an automated index or automated summary of the recording. Click the appropriate radio button to create an automated index or summary. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2024-11-25/00bcfc29-3b40-4460-9693-2bb599b0bb56/ascreenshot.jpeg?tl_px=0,1&br_px=982,550&force_format=jpeg&q=100&width=983&wat_scale=87&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=435,243 10. Review the duration of the media file and the price of the transcription job. Click the “Create Transcription Job” button to submit the automated transcription request. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2024-11-25/df92aa5b-f97e-4a4c-ab33-f656f0b59df0/ascreenshot.jpeg?tl_px=365,311&br_px=1225,792&force_format=jpeg&q=100&width=860&wat_scale=76&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=402,212 11. A confirmation message for the transcription request will be displayed on the Resource Detail page. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2024-11-25/9edb7c07-f4b3-491a-ad0f-19f9d67526ef/ascreenshot.jpeg?tl_px=762,7&br_px=1388,357&force_format=jpeg&q=100&width=625&wat_scale=55&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=245,58 Note: Automated transcription is a two-step process. Any organizational user can create an automated transcription request, but organizational owners must approve the pending requests. See the Approving automated metadata requests page for the steps to approve pending requests.
Trint transcription
Aviary offers the option for organizations with Trint subscriptions to integrate their accounts and request automated transcriptions directly from Aviary using the Trint API. This option varies slightly from the other automated metadata services available in Aviary. Unlike the other automated metadata options, organizations need to configure the API integration and manage additional account settings, including paymets, in Trint. Trint supports features including: Automatic speaker diarization: Differentiates between speakers in the transcript Supports more than 40 languages: See Trint language support for more information Paying for automated Trint transcription: Pricing for Trint automated transcription is managed through the organization’s Trint subscription, instead of in Aviary. Please refer to your organization’s Trint account to review payment information. Configuring the Trint API integration: 1. To configure the Trint API integration, click the “Integrations” tab in the staff menu. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/036b7d18-e9aa-4b2f-8032-165fb9506146/ascreenshot.jpeg?tl_px=3,685&br_px=532,981&force_format=jpeg&q=100&width=529&wat_scale=47&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=72,80 2. Click “Trint” in the dropdown menu. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/fb67c64d-569c-4d69-b109-bde1b49672a1/ascreenshot.jpeg?tl_px=0,714&br_px=529,1010&force_format=jpeg&q=100&width=529&wat_scale=47&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=25,136 3. Check the “Enable Trint Integration?” checkbox. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/2b5f5b14-4f54-4bf9-b7af-5c13aad801eb/user_cropped_screenshot.jpeg?tl_px=384,218&br_px=957,538&force_format=jpeg&q=100&width=573&wat_scale=51&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=27,70 4. Login to Trint and generate an API key. See the screencast below for where to find the API page on the Trint website and generate your API key. 5. Once you have generated your Trint API key, copy it into the “Trint API Key” field. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/05e0db8e-1711-45a8-8fd3-9f054fa70624/user_cropped_screenshot.jpeg?tl_px=389,227&br_px=962,548&force_format=jpeg&q=100&width=573&wat_scale=51&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=74,106 6. Click the “Save” button to save the Trint API Key in Aviary. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/5f3fef4a-2e12-4e48-ae8b-47a1eed825d1/user_cropped_screenshot.jpeg?tl_px=1048,649&br_px=1813,1076&force_format=jpeg&q=100&width=764&wat_scale=68&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=634,340 7. After saving the Trint API Key, Aviary will create a Callback URL. Click the “Copy” button to copy the Callback URL. Return to the organization’s Trint settings and paste the Callback URL into the “Your Callback URL” field on the Trint API page. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/24695024-3186-4823-8830-41a7b1b84cfa/user_cropped_screenshot.jpeg?tl_px=394,212&br_px=1020,561&force_format=jpeg&q=100&width=625&wat_scale=55&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=326,266 8. Check the “Transcript Complete” and/or “Transcript Verified” checkboxes to confirm which transcription steps will be updated during the callback process. Confirm the Callback Settings and click the “Save” button to complete the Trint integration configuration. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/866ee43b-98eb-4eaf-a503-4b0697818028/File.jpeg?tl_px=1048,652&br_px=1813,1080&force_format=jpeg&q=100&width=764&wat_scale=68&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=634,337Requesting an automated transcript from Trint: Navigate to the Resource Detail page for the resource to be transcribed.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-01-30/cda3d429-e233-49a7-bb0e-b26c70d66d95/ascreenshot.jpeg?tl_px=0,0&br_px=1376,769&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=508,93 If more than one media file is associated with the resource, select the media file to be transcribed in the Media File carousel. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-01-30/244ee06c-fec8-447e-8672-1480b63d8b26/ascreenshot.jpeg?tl_px=0,198&br_px=1376,968&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=402,371 Click the vertical ellipsis (⋮) in the Transcript menu.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/a595ad33-49dc-4b18-9589-ef8ef7d043c6/ascreenshot.jpeg?tl_px=931,307&br_px=1914,856&force_format=jpeg&q=100&width=983&wat_scale=87&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=891,80 Click “Request new transcript” in the dropdown menu.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/80598ddc-cb1b-4582-8a02-44438afad110/ascreenshot.jpeg?tl_px=937,306&br_px=1920,855&force_format=jpeg&q=100&width=983&wat_scale=87&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=805,123 Choose “Trint” from the list of automated transcription services.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/e556f5a0-2847-48ec-a9f1-aa43a7a8dc43/ascreenshot.jpeg?tl_px=544,248&br_px=1920,1017&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=867,277 The “Request Transcript” modal window will open. Select the language of the recording that will be transcribed.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/1060882d-cef6-42d2-9659-33942aeedee6/ascreenshot.jpeg?tl_px=461,103&br_px=1444,653&force_format=jpeg&q=100&width=983&wat_scale=87&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=514,133 Check the “Speaker Diarization” checkbox to enable speaker identification. This feature will differentiate between speakers in the transcript.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/2b869342-1761-4eba-906b-80dd7d7b6234/ascreenshot.jpeg?tl_px=424,105&br_px=1407,654&force_format=jpeg&q=100&width=983&wat_scale=87&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=274,187 Select or create a new folder where the transcript will be saved in the organization’s Trint account.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/1236134f-f54d-4b20-870f-753e4c768ad6/ascreenshot.jpeg?tl_px=463,105&br_px=1446,654&force_format=jpeg&q=100&width=983&wat_scale=87&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=492,317 Enter a folder name, if creating a new folder where the transcript will be saved in the organization’s Trint account.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/c6dd6476-0728-4f78-b019-46218d6bc37b/ascreenshot.jpeg?tl_px=349,105&br_px=1496,746&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=519,355 Review the duration of the media file. Click the “Create Transcription Job” button to submit the automated transcription request.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/266f1dee-fa17-43a9-8dc8-9ecbaaa3fc3d/ascreenshot.jpeg?tl_px=359,103&br_px=1506,744&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=675,508 A confirmation message will be displayed on the Resource Detail page after submitting the transcription request.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/ad6ff1a0-4df7-4e68-b7b5-894c904a77bc/ascreenshot.jpeg?tl_px=1232,118&br_px=1920,503&force_format=jpeg&q=100&width=688&wat_scale=61&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=233,52 Note: Automated transcription is a two-step process. Any organizational user can create an automated transcription request, but organizational owners must approve the pending requests. See the Approving automated metadata requests page for the steps to approve pending requests.
Gentle transcript alignment
It’s possible to align text transcripts without timecode to media files in Aviary. Aviary has implemented the Gentle application for transcript alignment. Gentle takes text transcripts that do not have existing timecode and aligns them with audiovisual media files. Gentle is designed specifically for aligning transcripts in English, but it is very lenient. This means that Gentle can align lower quality audio recordings to existing transcripts. Gentle can also align highly edited transcripts, which may include aligning words in a transcript that are not spoken in the recording, or skipping words in a recording that are not included in the transcript. Gentle simply aligns all of the words it recognizes with the timecode in a media file and arranges any remaining words from the transcript between the recognized sections. Gentle is also a good choice for legacy transcripts formatted with longer paragraphs. When aligning a legacy transcript, Gentle automatically produces a new transcript with the same paragraph formatting as the original. Gentle also creates captions for each sentence in a transcript. The captions can be displayed in the Aviary media player to improve accessibility for users viewing media files. Gentle supports features including: Align transcripts that do not have timecode: This service is particularly useful for highly edited transcripts that either omit or add words that are not spoken in the recording. Maintain existing transcript formatting: spacing and paragraph Output transcripts and captions: for accessibility Note: The Gentle transcript alignment service provided by Aviary runs once per day. It may take up to 24 hours for your request to be completed. This completion time varies based on when the transcript alignment is requested. Requesting transcript alignment with Gentle: Navigate to the Resource Detail page for the resource to be transcribed.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-01-30/cda3d429-e233-49a7-bb0e-b26c70d66d95/ascreenshot.jpeg?tl_px=0,0&br_px=1376,769&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=508,93 If more than one media file is associated with the resource, select the media file to be transcribed in the Media File carousel. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-01-30/244ee06c-fec8-447e-8672-1480b63d8b26/ascreenshot.jpeg?tl_px=0,198&br_px=1376,968&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=402,371 Click the vertical ellipsis (⋮) in the Transcript menu.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/a595ad33-49dc-4b18-9589-ef8ef7d043c6/ascreenshot.jpeg?tl_px=931,307&br_px=1914,856&force_format=jpeg&q=100&width=983&wat_scale=87&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=891,80 Click “Align existing transcript” in the dropdown menu.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/04b1ce75-8e1a-445c-bcac-3262ca94e5b7/ascreenshot.jpeg?tl_px=1058,480&br_px=3023,1579&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=889,423 Choose “Gentle” from the dropdown menu.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/f8586892-efee-40e9-93c6-f43869c3ab9f/ascreenshot.jpeg?tl_px=1058,497&br_px=3023,1596&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=801,462 To align a transcript that is not already in Aviary, click “Select File” and choose the transcript file to be uploaded from local storage. Note: Transcripts must be formatted as TXT files.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/43f1a4a0-0869-4dfa-aaed-7229117d1dcc/ascreenshot.jpeg?tl_px=463,276&br_px=2429,1374&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=377,123 OR click the “Select Existing Transcript” dropdown menu to select a transcript that is already in Aviary to be aligned.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/85256d31-6221-4323-b037-a7096f0e22aa/ascreenshot.jpeg?tl_px=539,315&br_px=2504,1413&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=499,293 Check the “Remove existing legacy timecode before alignment?” checkbox to remove any existing timecode from the transcript and automatically generate updated timecode during the alignment process.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/edbe69c5-8819-477d-8e96-cf3d9b8b0288/ascreenshot.jpeg?tl_px=467,289&br_px=2433,1388&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=304,382 Review the duration of the media file and the price of the transcript alignment job. Click the “Create Automated Transcript Alignment Job" button to submit the transcript alignment request.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/898d007d-9f6d-434d-aae6-1843fa3efbe7/ascreenshot.jpeg?tl_px=468,437&br_px=2434,1536&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=611,522 A confirmation message will be displayed on the Resource Detail page after submitting the transcription request.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-12/ad6ff1a0-4df7-4e68-b7b5-894c904a77bc/ascreenshot.jpeg?tl_px=1232,118&br_px=1920,503&force_format=jpeg&q=100&width=688&wat_scale=61&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=233,52 Note: Automated transcription is a two-step process. Any organizational user can create an automated transcription request, but organizational owners must approve the pending requests. See the Approving automated metadata requests page for the steps to approve pending requests.
Requesting automated transcriptions in bulk
Automated transcription requests can be submitted for multiple Aviary resources or media files as a bulk edit process. The bulk edit process for automated transcription requests can be submitted from the Collections, Resources, or Media tables. These tables feature different ways to view assets in an Aviary organization, but the process for creating bulk automated transcription requests is similar within the tables. Requesting automated transcriptions from the Media table: 1. To request bulk automated transcription jobs from the Media table, click the “Media” tab in the staff menu. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/a28d7883-fc46-4ada-ba51-0c3ce02f78e7/ascreenshot.jpeg?tl_px=0,211&br_px=688,596&force_format=jpeg&q=100&width=688&wat_scale=61&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=74,215 2. Click the checkboxes next to the media files that will be transcribed. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/8a2a21f0-04a9-47a8-8e8b-5a848db3d1df/ascreenshot.jpeg?tl_px=247,291&br_px=1107,772&force_format=jpeg&q=100&width=860&wat_scale=76&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=34,386 3. Click the “Table Options” button and select “Bulk Edit Options” from the dropdown menu. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/cd0a77ea-e9ac-46f6-95ca-0b74901d4a6b/ascreenshot.jpeg?tl_px=1152,232&br_px=1916,659&force_format=jpeg&q=100&width=764&wat_scale=68&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=498,140 4. The Bulk Edit modal window will open. Select “Create Transcription Job” from the dropdown menu. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/89fee682-36f7-457e-89de-874e1572c7af/ascreenshot.jpeg?tl_px=605,104&br_px=1293,489&force_format=jpeg&q=100&width=688&wat_scale=61&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=359,215 5. Select the preferred automated transcription service from the “Select transcript service” dropdown menu. Tip: Learn more about each service in the Automated Metadata user guide. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/aab31a2a-4298-4829-ac41-ccdcae312e2d/ascreenshot.jpeg?tl_px=433,115&br_px=1416,665&force_format=jpeg&q=100&width=983&wat_scale=87&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=365,211 6. Select the language to be transcribed in the Language dropdown menu. Complete any additional fields required by the automated transcription service in the Bulk Edit window. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/a8e07308-7ffe-463f-9965-8310c538aca6/ascreenshot.jpeg?tl_px=339,104&br_px=1486,745&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=387,228 7. Click the “Apply” button. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/e7d541de-32df-41cf-b33c-d54bb07080ef/ascreenshot.jpeg?tl_px=355,107&br_px=1502,748&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=738,516 8. The Bulk Edit Review modal window will open. Review the media files that will be transcribed. Click the “Apply” button to submit the automated transcription request. https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2025-02-13/506fb3b2-0e37-4c8f-b1e4-91543bbb2f40/ascreenshot.jpeg?tl_px=380,104&br_px=1526,745&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=1053,439 Note: Automated transcription is a two-step process. Any organizational user can create an automated transcription request, but organizational owners must approve the pending requests. See the Approving automated metadata requests page for the steps to approve pending requests. Requesting automated transcriptions from the Resources table: Requesting automated transcriptions in bulk from the Resources table is a similar process to the steps above. Note: Submitting an automated transcription request for a resource will create separate transcription jobs for each media file associated with the resource. Click the “Resources” tab in the staff menu. Click the checkboxes next to the resources that will be transcribed. Click the “Table Options” button and select “Bulk Edit Options” from the dropdown menu. The Bulk Edit modal window will open. Select “Create Transcription Job” from the dropdown menu. Select the preferred automated transcription service from the “Select transcript service” dropdown menu. Select the language to be transcribed in the Language dropdown menu. Complete any additional fields required by the automated transcription service in the Bulk Edit window. Click the “Apply” button. The Bulk Edit Review modal window will open. Review the resources that will be transcribed. Click the “Apply” button to submit the automated transcription request. Requesting automated transcriptions from the Collections table: Requesting automated transcriptions in bulk from the Collections table is a similar process to the steps above. Note: Automated transcription requests can be submitted for multiple resources OR multiple media files in a collection. See below for information about submitting bulk transcription requests for resources or media files in a collection. Click the “Collections” tab in the staff menu. Find the collection containing the media files that will be transcribed. To submit bulk automated transcription requests for resources in the collection, click the “Manage Resources” button in the Collections table. To submit bulk automated transcription requests for media files in the collection, click the “Manage Media” button. A table of resources or media files in the collection will be displayed. Click the checkboxes next to the resources or media files that will be transcribed. Click the “Table Options” button and select “Bulk Edit Options” from the dropdown menu. The Bulk Edit modal window will open. Select “Create Transcription Job” from the dropdown menu. Select the preferred automated transcription service from the “Select transcript service” dropdown menu. Select the language to be transcribed in the Language dropdown menu. Complete any additional fields required by the automated transcription service in the Bulk Edit window. Click the “Apply” button. The Bulk Edit Review modal window will open. Review the resources or media files that will be transcribed. Click the “Apply” button to submit the automated transcription request.
Approving automated metadata requests
Generating automated metadata in Aviary is a two-step process: An Aviary organizational user creates the automated metadata request. Any Aviary organizational user can request an automated transcription or translation of a resource. After a user has created an automated metadata request, an Organization Owner must approve the pending request to complete the process. Approving pending automated metadata requests: To approve pending requests as an Aviary Organization Owner, click the “Automated Metadata” tab in the staff menu.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2024-11-25/f13ebbd6-749e-459a-aeb0-9629cf37f0b2/ascreenshot.jpeg?tl_px=0,395&br_px=859,876&force_format=jpeg&q=100&width=860&wat_scale=76&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=65,230 The Automated Metadata page will display a table of all automated metadata requests for the organization. Each request is organized as its own row and has additional information to help with reviewing the request. The “View” button allows the Organization Owner to view a resource on the Resource Detail page and determine if the request should be approved.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2024-11-25/4e2518dc-333b-47b1-b1a4-b66206895c05/ascreenshot.jpeg?tl_px=241,149&br_px=1388,790&force_format=jpeg&q=100&width=1120.0&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=588,276 Click the “Approve” button to approve the request and start the automated process.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2024-11-25/ca125826-0e7f-41d6-857b-ae57e6600579/File.jpeg?tl_px=410,230&br_px=1270,711&force_format=jpeg&q=100&width=860&wat_scale=76&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=665,212 The Approve Transcription modal window will be displayed. Organization Owners must click “Yes” to complete the approval process.https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2024-11-25/8a76b224-4824-4a2b-a5c9-b79fe6ef1e2a/File.jpeg?tl_px=277,0&br_px=1260,549&force_format=jpeg&q=100&width=983&wat_scale=87&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=525,211 After approving a request, its status on the Automated Metadata page will change to “In progress”. The time to complete an automated metadata request may vary depending on the length of the media file being transcribed or translated and the existing queue of jobs. Once the transcription or translation is finished, the status will be updated to “Complete.”https://ajeuwbhvhr.cloudimg.io/colony-recorder.s3.amazonaws.com/files/2024-11-25/c457a64d-3b96-40ad-a204-da9dc2c193db/File.jpeg?tl_px=410,232&br_px=1270,713&force_format=jpeg&q=100&width=860&wat_scale=76&wat=1&wat_opacity=0.7&wat_gravity=northwest&wat_url=https://colony-recorder.s3.us-west-1.amazonaws.com/images/watermarks/FB923C_standard.png&wat_pad=439,212
Automated metadata overview
Date Updated
Aviary offers organizations the ability to generate automated transcripts and translations for resources directly in the platform. This process involves generating transcripts and translations in Aviary using services Whisper, IBM Watson, Deepgram, AssemblyAI, and Trint.
The following sections of the Automated Metadata user guide provide more information about the process for requesting automated transcripts and translations. This includes selecting an automated service to use, enabling transcription features, paying for automated metadata transactions, and the organizational approval processes.
info
Note: Media files must be hosted by Aviary to request automated metadata services. It is not possible to send content embedded in Aviary to automated services (e.g., media files embedded from Youtube, Avalon, Vimeo, or URLs).


Want to print your doc?
This is not the way.
Try clicking the ⋯ next to your doc name or using a keyboard shortcut (
CtrlP
) instead.