Skip to content
Open
Show file tree
Hide file tree
Changes from 5 commits
Commits
Show all changes
97 commits
Select commit Hold shift + click to select a range
f722f31
Fix: merged audino with cvat
ashish7515 Jul 13, 2024
fe5581a
Fix: yolo
ashish7515 Jul 14, 2024
19fdf7e
Fix: error while creating jobs
ashish7515 Jul 14, 2024
36ecba9
Fix: make migrations
ashish7515 Jul 14, 2024
dec9763
fix: dynamic email templates
ashish7515 Jul 15, 2024
3b21c34
Fix: host for ai-annotation
ashish7515 Jul 15, 2024
239e393
Update Dockerfile
rohan220217 Jul 16, 2024
f760075
Fix: update schema
ashish7515 Jul 21, 2024
7a0862d
Merge branch 'merged_cvat_audino' of https://github.com/midas-researc…
ashish7515 Jul 21, 2024
eb5de95
Fix: linter checks
ashish7515 Jul 21, 2024
dca52dd
Fix: timeout request
ashish7515 Jul 21, 2024
be83114
Fix: github test issues
ashish7515 Jul 21, 2024
e1b5c95
Fix: Quality Report parameters and add extra_params in task model
ashish7515 Jul 31, 2024
391c3dc
sorting by start column in pandas dataframe for task and job
siesto1elemento Aug 18, 2024
581efe2
Merge pull request #13 from siesto1elemento/my-feature-branch
rohan220217 Aug 20, 2024
b2eecb2
feat: audino flags added in task and job
rohan220217 Aug 20, 2024
ac403c3
Added_voxceleb, VCTK_corpus and Librivox formats for annotation exports
siesto1elemento Aug 20, 2024
72e4778
Merge pull request #14 from siesto1elemento/my-feature-branch
rohan220217 Aug 23, 2024
0b6165a
Made corrections for LibriVox
siesto1elemento Aug 23, 2024
6e12726
Initial Commit
Vignesh16879 Sep 1, 2024
f1597a8
1.1
Vignesh16879 Sep 1, 2024
255c3d0
1.2
Vignesh16879 Sep 1, 2024
669e733
1.3
Vignesh16879 Sep 1, 2024
35d512f
1.4
Vignesh16879 Sep 1, 2024
38401fe
Update views.py
Vignesh16879 Sep 4, 2024
f91702d
1.5
Vignesh16879 Sep 4, 2024
050ad75
1.6
Vignesh16879 Sep 6, 2024
fc2171e
Merge pull request #15 from siesto1elemento/my-feature-branch
rohan220217 Sep 6, 2024
6cd1a20
1.6
Vignesh16879 Sep 6, 2024
9fda5a2
1.7
Vignesh16879 Sep 6, 2024
e3bf29f
1.7
Vignesh16879 Sep 11, 2024
61f4046
notifications
Vignesh16879 Sep 11, 2024
166ba26
notifications
Vignesh16879 Sep 11, 2024
01bf22d
notifications
Vignesh16879 Sep 11, 2024
4f91723
added cloud storage for task creation and video chunk creator
Vignesh16879 Nov 11, 2024
5975ac8
Update permissions.py to original
Vignesh16879 Nov 11, 2024
edd588f
added cloud storage for task creation and video chunk creator
Vignesh16879 Nov 11, 2024
b81a78d
Merge pull request #18 from midas-research/added_notifications_to_mer…
rohan220217 Nov 22, 2024
2c6ed64
overlap added while creating task and gt
rohan220217 Nov 22, 2024
de6943e
fix: overlap attr added in job detail api
rohan220217 Nov 27, 2024
1417de0
fix: notifications api fixed and export format changed
rohan220217 Jan 20, 2025
859597a
fix: analytics issue
ashish7515 Jan 22, 2025
0f5d039
Merge pull request #20 from midas-research/feat/job-overlap
rohan220217 Feb 4, 2025
37473fb
Updated random gt segment selection
ashish7515 Feb 10, 2025
200916e
fix: version issue in requirements file
rohan220217 Feb 15, 2025
31dc72d
fix: random gt job and validation
ashish7515 Feb 27, 2025
d619712
Update AudioSegmentHandler
ashish7515 Feb 28, 2025
11ac5d2
Fix: added time stamps in export audino gt job
ashish7515 Mar 4, 2025
f56d8b5
fix: modify the segment login in gt
rohan220217 Mar 5, 2025
482da68
fix: added time_stamps in meta api
rohan220217 Mar 5, 2025
dcf1a05
feat: annotation created while creating gt
rohan220217 Mar 6, 2025
85ca11b
fix: roundoff issue and time_stamps fixed
rohan220217 Mar 7, 2025
2247d79
Fix: API Timeout error using Background task for GT job creation
ashish7515 Mar 13, 2025
8147314
Fix: Quality reports issue
ashish7515 Mar 19, 2025
032be19
Fix: add buffer of 1.5 sec for last segment of jobs to improve accuracy
ashish7515 Mar 19, 2025
d502046
Fix: min_duration for gt job segments in VAD
ashish7515 Mar 20, 2025
a803394
Fix: quality reports issue
ashish7515 Mar 20, 2025
347d227
Fix: gt job audio
ashish7515 Mar 24, 2025
f97ef26
Fix: AudioDatasetComparator issue
ashish7515 Mar 25, 2025
29686f9
Fix: worker.gt_jobs.conf isse in gt_jobs rq worker
ashish7515 Mar 27, 2025
1e341fb
Added Audino Format dataset for export and updated TaskFlags
ashish7515 Apr 7, 2025
7875668
Merge pull request #22 from midas-research/export-format-audino
rohan220217 Apr 8, 2025
5cddd86
fix: rename nationality to emotion
rohan220217 Apr 8, 2025
cf67f7b
commented the base.py code
rohan220217 Apr 8, 2025
b60bb92
chore: deploy audino_cvat_sdk
rohan220217 Apr 28, 2025
2e28726
feat: added mp3 format instead of wav
rohan220217 Feb 10, 2025
4aa296c
feat: support for mp3 format in audio processing and export
rohan220217 May 1, 2025
fd47ab1
Merge pull request #23 from midas-research/audio-format
rohan220217 May 1, 2025
098b468
Merge pull request #21 from midas-research/random-segments-gt
rohan220217 May 3, 2025
7ff1faa
Multi-Audio task and validation
ashish7515 May 18, 2025
ee49afd
Merge pull request #24 from midas-research/multi-audio-compatible-task
rohan220217 May 18, 2025
2199e05
fix: cors all origin access blocked
rohan220217 May 18, 2025
e344045
Merge pull request #25 from midas-research/audio-format
rohan220217 May 18, 2025
eb538d5
Fix: task export, data upload status, overlap
ashish7515 May 29, 2025
b7e9083
Merge pull request #26 from midas-research/multi-audio-compatible-task
rohan220217 May 29, 2025
071e61f
Fix: job export for cloud based tasks and upload progress
ashish7515 Jun 4, 2025
c478535
Fix: GT creation issue and added ai-annotation-lang field
ashish7515 Jun 13, 2025
7185407
update serializer and migrations
ashish7515 Jun 13, 2025
198ee26
Feat: New API for GT job creation status, Immediate score
ashish7515 Jun 27, 2025
1cd46b4
Fix: Update version in generate.sh and add authentication to immediat…
rohan220217 Jun 30, 2025
a35229c
Feat: Add audiowaveform peak generation and normalization, enhance au…
rohan220217 Jul 29, 2025
bc693b8
Fix: authorization while fetching audio peaks
rohan220217 Jul 29, 2025
fb9a4c2
feat: enhanced gt job processing and save annotation and fix the serv…
rohan220217 Aug 27, 2025
1b846bb
feat: improve score calculation for transcription and label matching …
rohan220217 Aug 28, 2025
6a75a57
feat: add is_create_gt_segment field to JobWriteSerializer for ground…
rohan220217 Aug 28, 2025
c6c0f62
feat: label matching task validation implemented
rohan220217 Sep 2, 2025
5caacc9
Merge pull request #27 from midas-research/feat/gt-annotation
rohan220217 Sep 2, 2025
bed5655
feat: enhance score calculation by adjusting ground truth sample poin…
rohan220217 Sep 4, 2025
74742ff
Merge pull request #28 from midas-research/feat/gt-annotation
rohan220217 Sep 4, 2025
02d16a5
feat: adjust minimum duration for audio segments and refine score cal…
rohan220217 Sep 6, 2025
79f8622
feat: implement token-based authentication for job data retrieval
rohan220217 Oct 6, 2025
2fac08d
feat: enhance score calculation for transcription tasks and improve l…
rohan220217 Oct 6, 2025
0261a4a
Fix consensus API endpoint and logic
OjasGarg06 Oct 6, 2025
f9a399e
Merge pull request #29 from midas-research/feat/gt-annotation
rohan220217 Oct 7, 2025
1d3655f
feat: implement consensus report generation to the quality report apis
rohan220217 Oct 7, 2025
1c5598f
Merge branch 'merged_cvat_audino' into consensus-api-fix
rohan220217 Oct 7, 2025
4e590de
Merge pull request #30 from midas-research/consensus-api-fix
rohan220217 Oct 7, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
29 changes: 15 additions & 14 deletions .vscode/launch.json
Original file line number Diff line number Diff line change
Expand Up @@ -167,8 +167,9 @@
"env": {
"CVAT_SERVERLESS": "1",
"ALLOWED_HOSTS": "*",
"DJANGO_LOG_SERVER_HOST": "localhost",
"DJANGO_LOG_SERVER_PORT": "8282"
"IAM_OPA_BUNDLE": "1",
// "DJANGO_LOG_SERVER_HOST": "localhost",
// "DJANGO_LOG_SERVER_PORT": "8282"
},
"args": [
"runserver",
Expand Down Expand Up @@ -210,8 +211,8 @@
"django": true,
"cwd": "${workspaceFolder}",
"env": {
"DJANGO_LOG_SERVER_HOST": "localhost",
"DJANGO_LOG_SERVER_PORT": "8282"
// "DJANGO_LOG_SERVER_HOST": "localhost",
// "DJANGO_LOG_SERVER_PORT": "8282"
},
"console": "internalConsole"
},
Expand All @@ -232,8 +233,8 @@
"django": true,
"cwd": "${workspaceFolder}",
"env": {
"DJANGO_LOG_SERVER_HOST": "localhost",
"DJANGO_LOG_SERVER_PORT": "8282"
// "DJANGO_LOG_SERVER_HOST": "localhost",
// "DJANGO_LOG_SERVER_PORT": "8282"
},
"console": "internalConsole"
},
Expand All @@ -254,8 +255,8 @@
"django": true,
"cwd": "${workspaceFolder}",
"env": {
"DJANGO_LOG_SERVER_HOST": "localhost",
"DJANGO_LOG_SERVER_PORT": "8282"
// "DJANGO_LOG_SERVER_HOST": "localhost",
// "DJANGO_LOG_SERVER_PORT": "8282"
},
"console": "internalConsole"
},
Expand All @@ -276,8 +277,8 @@
"django": true,
"cwd": "${workspaceFolder}",
"env": {
"DJANGO_LOG_SERVER_HOST": "localhost",
"DJANGO_LOG_SERVER_PORT": "8282"
// "DJANGO_LOG_SERVER_HOST": "localhost",
// "DJANGO_LOG_SERVER_PORT": "8282"
},
"console": "internalConsole"
},
Expand All @@ -295,8 +296,8 @@
"-i", "1"
],
"env": {
"DJANGO_LOG_SERVER_HOST": "localhost",
"DJANGO_LOG_SERVER_PORT": "8282"
// "DJANGO_LOG_SERVER_HOST": "localhost",
// "DJANGO_LOG_SERVER_PORT": "8282"
},
"console": "internalConsole"
},
Expand Down Expand Up @@ -355,8 +356,8 @@
"django": true,
"cwd": "${workspaceFolder}",
"env": {
"DJANGO_LOG_SERVER_HOST": "localhost",
"DJANGO_LOG_SERVER_PORT": "8282"
// "DJANGO_LOG_SERVER_HOST": "localhost",
// "DJANGO_LOG_SERVER_PORT": "8282"
},
"console": "internalConsole"
},
Expand Down
62 changes: 62 additions & 0 deletions cvat/apps/dataset_manager/formats/aud_common_voice.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,62 @@
# Copyright (C) 2020-2022 Intel Corporation
# Copyright (C) 2023-2024 CVAT.ai Corporation
#
# SPDX-License-Identifier: MIT





import os.path as osp
from glob import glob

from pyunpack import Archive

from cvat.apps.dataset_manager.bindings import (GetCVATDataExtractor,
import_dm_annotations, match_dm_item, find_dataset_root)
from cvat.apps.dataset_manager.util import make_zip_archive
from datumaro.components.extractor import DatasetItem
from datumaro.components.project import Dataset
from datumaro.plugins.yolo_format.extractor import YoloExtractor

from .registry import dm_env, exporter, importer
from cvat.apps.engine.log import ServerLogManager
slogger = ServerLogManager(__name__)


@exporter(name='YOLO', ext='ZIP', version='1.1')
def _export(dst_file, temp_dir, instance_data, save_images=False):
slogger.glob.info("HEYLO YOLO EXPORTER AUDINO")
# slogger.glob.debug()
dataset = Dataset.from_extractors(GetCVATDataExtractor(
instance_data, include_images=save_images), env=dm_env)

dataset.export(temp_dir, 'yolo', save_images=save_images)

make_zip_archive(temp_dir, dst_file)

@importer(name='YOLO', ext='ZIP', version='1.1')
def _import(src_file, temp_dir, instance_data, load_data_callback=None, **kwargs):
Archive(src_file.name).extractall(temp_dir)

image_info = {}
frames = [YoloExtractor.name_from_path(osp.relpath(p, temp_dir))
for p in glob(osp.join(temp_dir, '**', '*.txt'), recursive=True)]
root_hint = find_dataset_root(
[DatasetItem(id=frame) for frame in frames], instance_data)
for frame in frames:
frame_info = None
try:
frame_id = match_dm_item(DatasetItem(id=frame), instance_data,
root_hint=root_hint)
frame_info = instance_data.frame_info[frame_id]
except Exception: # nosec
pass
if frame_info is not None:
image_info[frame] = (frame_info['height'], frame_info['width'])

dataset = Dataset.import_from(temp_dir, 'yolo',
env=dm_env, image_info=image_info)
if load_data_callback is not None:
load_data_callback(dataset, instance_data)
import_dm_annotations(dataset, instance_data)
4 changes: 3 additions & 1 deletion cvat/apps/dataset_manager/formats/registry.py
Original file line number Diff line number Diff line change
Expand Up @@ -104,7 +104,7 @@ def make_exporter(name):
import cvat.apps.dataset_manager.formats.mot
import cvat.apps.dataset_manager.formats.mots
import cvat.apps.dataset_manager.formats.pascal_voc
import cvat.apps.dataset_manager.formats.yolo
# import cvat.apps.dataset_manager.formats.yolo
import cvat.apps.dataset_manager.formats.imagenet
import cvat.apps.dataset_manager.formats.camvid
import cvat.apps.dataset_manager.formats.widerface
Expand All @@ -117,3 +117,5 @@ def make_exporter(name):
import cvat.apps.dataset_manager.formats.lfw
import cvat.apps.dataset_manager.formats.cityscapes
import cvat.apps.dataset_manager.formats.openimages
#Audino Export Formats
import cvat.apps.dataset_manager.formats.aud_common_voice
Loading