YOLO Dataset Utilities

Small, focused scripts for preparing YOLO detection/pose datasets: annotation generation, label cleanup/merging, mosaics, and image preprocessing.

Conventions

Current Working Directory (CWD): where your images/ and labels/ live.
Script Directory: where the scripts, their .md docs, and .pt models live.
Most tools read/write relative to CWD unless a path is provided.

Tools

annotate_images.py: Run a YOLO pose model on images and write pose labels, with optional flipped inference.
cleanup_labels.py: Replace spaces with underscores in images/ and labels/, move orphan labels to labels-x, and create empty labels for unlabeled images.
correct_keypoints.py: Normalize person keypoint visibility flags and zero coordinates when invisible.
correct_face_keypoints.py: Merge improved face keypoints into pose labels with weighted blending and deviation stats.
add_face_keypoints.py: Add face keypoints from face labels into base pose labels using closest face-area matching (or nose matching with --nose).
correct_mpii_keypoints.py: Replace selected COCO keypoints with MPII pose keypoints using bbox-based matching.
predict_pose_optical_flow.py: Predict pose keypoints for a target indexed image/range by tracking from previous labeled frames with pyramidal Lucas-Kanade optical flow, reverse back-check, and optional multi-frame fusion to reduce drift.
crop_portrait_square_yolo.py: Face-centered square crops of portraits using YOLO pose keypoints, with optional resize/rotate/ratio/flip/debug.
crop_detected_objects_yolo.py: Crop images to keep all YOLO-detected boxes and visible keypoints with configurable pixel boundary, then rewrite labels.
crop_to_annotations_yolo.py: Crop images and YOLO labels around all boxes and keypoints with padded aspect ratio selection and prefixed outputs.
download_google_images.py: Download full-resolution images from Google or Yandex Images search URLs, save as JPEGs, and optionally resize.
download_videos_yt_dlp.py: Download TikTok/YouTube videos from urls.txt (or a single CLI URL) in best available quality using yt-dlp.
extend_flip_yolo.py: Extend images with a flipped duplicate and update bounding boxes/keypoints.
extract_video_frames.py: Extract frames from all videos in CWD into per-video folders, with optional frame skipping.
delete_similar_frames.py: Delete near-duplicate frames by comparing each image to the previous one.
extract_tfrecord_images.py: Extract JPEG images from TFRecord files with progress and stats.
merge_datasets.py: Merge multiple datasets into a unified train/val layout and write content.md counts.
merge_pose_results.py: Merge body and face pose labels, refining face points.
mosaic_self_yolo.py: Build self-mosaics and rewrite YOLO detection/pose labels.
mosaic_yolo.py: Build multi-image mosaics with optional flip/rotate and merged labels.
optimize_dataset_tiles_yolo.py: Tile images by size/aspect into mosaics, rewrite YOLO labels, and copy/rescale remaining images.
resize_images.py: Resize images and convert formats to JPEG by default with progress and stats.
sam3.py: Run SAM3 text prompts on one image and export YOLO bbox labels.
rename_images_labels.py: Rename images with matching labels using a pattern and update label filenames.
rotate_head_tilt_yolo.py: Rotate portraits based on head tilt and update pose labels.
rotate_images_labels.py: Rotate images to fixed angles and update labels, supporting YOLO detection and pose.
yolo_pose_to_coco_json.py: Convert YOLO11-pose labels into COCO JSON files for train/val splits.
visualize-pose.py: Overlay YOLO pose keypoints and boxes onto images for quick inspection.

Docs

Each script has a matching .md file in this directory with full usage and arguments.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
.gitignore		.gitignore
AGENTS.md		AGENTS.md
README.md		README.md
add_face_keypoints.md		add_face_keypoints.md
add_face_keypoints.py		add_face_keypoints.py
annotate_images.md		annotate_images.md
annotate_images.py		annotate_images.py
check_dataset.py		check_dataset.py
cleanup_labels.md		cleanup_labels.md
cleanup_labels.py		cleanup_labels.py
correct_face_keypoints.md		correct_face_keypoints.md
correct_face_keypoints.py		correct_face_keypoints.py
correct_keypoints.md		correct_keypoints.md
correct_keypoints.py		correct_keypoints.py
correct_mpii_keypoints.md		correct_mpii_keypoints.md
correct_mpii_keypoints.py		correct_mpii_keypoints.py
crop_detected_objects_yolo.md		crop_detected_objects_yolo.md
crop_detected_objects_yolo.py		crop_detected_objects_yolo.py
crop_portrait_square_yolo.md		crop_portrait_square_yolo.md
crop_portrait_square_yolo.py		crop_portrait_square_yolo.py
crop_to_annotations_yolo.md		crop_to_annotations_yolo.md
crop_to_annotations_yolo.py		crop_to_annotations_yolo.py
dataset.yaml		dataset.yaml
delete_similar_frames.md		delete_similar_frames.md
delete_similar_frames.py		delete_similar_frames.py
download_google_images.md		download_google_images.md
download_google_images.py		download_google_images.py
download_videos_yt_dlp.md		download_videos_yt_dlp.md
download_videos_yt_dlp.py		download_videos_yt_dlp.py
extend_flip_yolo.md		extend_flip_yolo.md
extend_flip_yolo.py		extend_flip_yolo.py
extract_tfrecord_images.md		extract_tfrecord_images.md
extract_tfrecord_images.py		extract_tfrecord_images.py
extract_video_frames.md		extract_video_frames.md
extract_video_frames.py		extract_video_frames.py
merge_datasets.md		merge_datasets.md
merge_datasets.py		merge_datasets.py
merge_pose_results.md		merge_pose_results.md
merge_pose_results.py		merge_pose_results.py
mosaic_self_yolo.md		mosaic_self_yolo.md
mosaic_self_yolo.py		mosaic_self_yolo.py
mosaic_yolo.md		mosaic_yolo.md
mosaic_yolo.py		mosaic_yolo.py
optimize_dataset_tiles_yolo.md		optimize_dataset_tiles_yolo.md
optimize_dataset_tiles_yolo.py		optimize_dataset_tiles_yolo.py
predict_pose_optical_flow.md		predict_pose_optical_flow.md
predict_pose_optical_flow.py		predict_pose_optical_flow.py
rename_images_labels.md		rename_images_labels.md
rename_images_labels.py		rename_images_labels.py
resize_images.md		resize_images.md
resize_images.py		resize_images.py
rotate_head_tilt_yolo.md		rotate_head_tilt_yolo.md
rotate_head_tilt_yolo.py		rotate_head_tilt_yolo.py
rotate_images_labels.md		rotate_images_labels.md
rotate_images_labels.py		rotate_images_labels.py
sam3.md		sam3.md
sam3.py		sam3.py
visualize-pose.md		visualize-pose.md
visualize-pose.py		visualize-pose.py
yolo_pose_to_coco_json.md		yolo_pose_to_coco_json.md
yolo_pose_to_coco_json.py		yolo_pose_to_coco_json.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

YOLO Dataset Utilities

Conventions

Tools

Docs

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

YOLO Dataset Utilities

Conventions

Tools

Docs

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages