Importing local audio processing
If your raw data is audio, then follow the instructions in this document for processing.
Last updated
If your raw data is audio, then follow the instructions in this document for processing.
Last updated
Select 'From Audio' and 'New Task' in the 'Import Data Source' section on the dataset details page.
All imports are called tasks. In a task, you can add multiple similar pieces of data for processing. In this section, multiple audio files can be added to the local audio task.
On the task creation page, enter a name for your task (up to 20 characters). This name will help you quickly find and manage the task in the task list.
Click on the upload area, drag the local files you want to import into the upload box, or click the upload button to select files for uploading.
Supported file formats include: .mp3
、.WAV
.
You can upload up to 50 files per task, with each file not exceeding 200MB.
Make sure that the multiple files uploaded in one task are similar in content, so the parameters and output processing can be applied correctly.
The task settings are similar to those for importing tasks from a webpage, including field configuration and content extraction.
Choose the appropriate parsing method based on the file type to ensure the system can properly process the uploaded files.
Default Field Types:
Timeline: The system will attempt to extract timeline information from the audio content.
Text Details: The system will attempt to extract dialogue text from the audio content.
Text Language: The system will attempt to extract dialogue language information from the audio content.
Custom Fields:
If you need to classify specific extracted data into certain fields, you can click "+ Add Field" and add the field name and description.
For example, if there is a nickname in the audio that needs to be extracted, the field name key would be: nickname, and the field description: user nickname.
Please add in English only. The more detailed the field description, the more accurate the extraction will be.
Once you have configured the fetch parameters, you will need to set up output settings to determine how the extracted data will be saved and exported.
Output Format Settings
You can choose to save the retrieved data in either JSON or Markdown format. JSON format is more suitable for subsequent API program calls, while Markdown format is better suited for knowledge base data processing.
Save and manually execute the task later:
If you wish to configure the task first without initiating the scraping process immediately, you can click on the "Save and manually execute the task later" button. The task will be saved in the task list for manual initiation at a later time.
Execute the task immediately:
If you are prepared to instantly scrape webpage data, click the "Execute the task immediately" button. The system will commence data scraping and import it into the specified dataset.