-
Notifications
You must be signed in to change notification settings - Fork 129
SSoC' 25 blog post for week 06 by Muhammad Haroon #306
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 5 commits
Commits
Show all changes
7 commits
Select commit
Hold shift + click to select a range
7900823
ssoc-week06-update
haroon10725 1546ef6
update excerpt and next week's roadmap
haroon10725 136ff89
add proof of concept
haroon10725 b7bffc3
update tags
haroon10725 16fe8aa
add a youtube video
haroon10725 0a37293
update youtube video format and file and folder names
haroon10725 af8fd94
update avatar file path
haroon10725 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
1 change: 1 addition & 0 deletions
1
public/assets/Developers/Muhammad Haroon/Muisc Blocks project.html
Large diffs are not rendered by default.
Oops, something went wrong.
Binary file added
BIN
+179 KB
...ssets/Developers/Muhammad Haroon/TangoFlux - Text to Audio Generation Model.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+60.2 KB
...c/assets/Developers/Muhammad Haroon/Trimming AI generated audio in Audacity.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
78 changes: 78 additions & 0 deletions
78
src/constants/MarkdownFiles/posts/2025-07-13-ssoc-2025-MuhammadHaroon-week06.md
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,78 @@ | ||
| --- | ||
| title: "SSoC ’25 Week 06 Update by Muhammad Haroon" | ||
| excerpt: "Find another open-source model that can generate sound samples from prompts." | ||
| category: "DEVELOPER NEWS" | ||
| date: "2025-07-13" | ||
| slug: "2025-07-13-ssoc-25-MuhammadHaroon-week06" | ||
| author: "@/constants/MarkdownFiles/authors/muhammad-haroon.md" | ||
| tags: "ssoc25,midterm,sugarlabs,week06,GenAI,MusicBlocks,Music" | ||
| image: "assets/Images/GSOC.png" | ||
| --- | ||
|
|
||
| <!-- markdownlint-disable --> | ||
|
|
||
| # Week 06 Progress Report by Muhammad Haroon | ||
|
|
||
| **Project:** [Generative AI Instrument Sample Generation for Music Blocks](https://github.com/sugarlabs/GSoC/blob/master/Ideas-2025.md#Generative-AI-Instrument-Sample-Generation-for-Music-Blocks) | ||
| **Mentors:** [Walter Bender](https://github.com/walterbender), [Sumit Srivastava](https://github.com/sum2it) | ||
| **Assisting Mentors:** [Devin Ulibarri](https://github.com/pikurasa) | ||
| **Reporting Period:** 2025-07-07 - 2025-07-13 | ||
|
|
||
| --- | ||
|
|
||
| ## Progress Summary | ||
|
|
||
| The first half of my Sugar Summer of Code 2025 project has been both challenging and rewarding. My project focuses on generating sound samples from prompts. Over the past six weeks, I've made significant progress toward this goal, working closely with my mentors which is as follow: | ||
|
|
||
| - Researched state of art open-source models for generating sound samples from prompts, including [AudioGen](https://audiocraft.metademolab.com/audiogen.html) and [TangoFlux](https://huggingface.co/spaces/declare-lab/TangoFlux). | ||
|
|
||
| - Set up the models locally to enable hands on testing and integration. | ||
|
|
||
| - Tested the models using various prompts. (High quality sound of dog bark, A natural and rich sound of car horn etc). Saved the sounds samples into Google Drive which was then scored by both the mentors. | ||
|
|
||
| - Concluded that the generated sound samples cannot be used directly in Music Blocks, they need to be trimmed before integration. | ||
|
|
||
| --- | ||
|
|
||
| ## Goals for This Week | ||
|
|
||
| - **Goal 1:** Find another open-source model to generate high quality sound samples. | ||
|
|
||
| --- | ||
|
|
||
| ## This Week's Achievements | ||
|
|
||
| 1. **Find another open-source model** | ||
| - I was sucessfully able to find another open-source model [TangoFlux](https://huggingface.co/spaces/declare-lab/TangoFlux). However, after testing several prompts, I realized that these models are unable to generate sound samples suitable for use in Music Blocks. After discussing this with the mentors, we decided that an additional tool is needed, one that can clip audio between specific timestamps. This trimmed audio can then be used in Music Blocks, similar to how it's done in Audacity software. | ||
|
|
||
| --- | ||
|
|
||
| ## Next Week's Roadmap | ||
|
|
||
| - Create sketches of user interfaces showing how students will interact with the GenAI and the audio trimming tool. | ||
|
|
||
| --- | ||
|
|
||
| ## Proof of Concept (POC) | ||
|
|
||
| Since current LLMs are not capable of generating audio that can be used directly in Music Blocks, the following workflow serves as an alternative approach: | ||
|
|
||
| - The user first generates an audio using a text-to-audio model such as TangoFlux. | ||
|  | ||
|
|
||
| - The generated audio is then passed into an audio trimming tool, for demonstration purpose I am using an Audacity software. | ||
|  | ||
|
|
||
|  | ||
|
|
||
| - Click on the video below to see how can we import the audio into Music Blocks: | ||
|
|
||
| [](https://youtu.be/eR8hfvNzPTg) | ||
|
|
||
| --- | ||
|
|
||
| ## Acknowledgments | ||
|
|
||
| Thank you to my mentors, the Sugar Labs community, and fellow GSoC contributors for ongoing support. | ||
|
|
||
| --- | ||
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Instead of this picture and link method of "embedding" a YT video, please use the method provided by the website.
Code:
This is documented in https://github.com/sugarlabs/www-v2/blob/main/src/constants/MarkdownFiles/posts/TESTMarkdownFormat.md
Comparison:
What it looks like now:
What it looks like using the YT feature in our MD:
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done.