|
| 1 | +--- |
| 2 | +title: "SSoC ’25 Week 06 Update by Muhammad Haroon" |
| 3 | +excerpt: "Find another open-source model that can generate sound samples from prompts." |
| 4 | +category: "DEVELOPER NEWS" |
| 5 | +date: "2025-07-13" |
| 6 | +slug: "2025-07-13-ssoc-25-MuhammadHaroon-week06" |
| 7 | +author: "@/constants/MarkdownFiles/authors/muhammad-haroon.md" |
| 8 | +tags: "ssoc25,midterm,sugarlabs,week06,GenAI,MusicBlocks,Music" |
| 9 | +image: "assets/Images/GSOC.png" |
| 10 | +--- |
| 11 | + |
| 12 | +<!-- markdownlint-disable --> |
| 13 | + |
| 14 | +# Week 06 Progress Report by Muhammad Haroon |
| 15 | + |
| 16 | +**Project:** [Generative AI Instrument Sample Generation for Music Blocks](https://github.com/sugarlabs/GSoC/blob/master/Ideas-2025.md#Generative-AI-Instrument-Sample-Generation-for-Music-Blocks) |
| 17 | +**Mentors:** [Walter Bender](https://github.com/walterbender), [Sumit Srivastava](https://github.com/sum2it) |
| 18 | +**Assisting Mentors:** [Devin Ulibarri](https://github.com/pikurasa) |
| 19 | +**Reporting Period:** 2025-07-07 - 2025-07-13 |
| 20 | + |
| 21 | +--- |
| 22 | + |
| 23 | +## Progress Summary |
| 24 | + |
| 25 | +The first half of my Sugar Summer of Code 2025 project has been both challenging and rewarding. My project focuses on generating sound samples from prompts. Over the past six weeks, I've made significant progress toward this goal, working closely with my mentors which is as follow: |
| 26 | + |
| 27 | +- Researched state of art open-source models for generating sound samples from prompts, including [AudioGen](https://audiocraft.metademolab.com/audiogen.html) and [TangoFlux](https://huggingface.co/spaces/declare-lab/TangoFlux). |
| 28 | + |
| 29 | +- Set up the models locally to enable hands on testing and integration. |
| 30 | + |
| 31 | +- Tested the models using various prompts. (High quality sound of dog bark, A natural and rich sound of car horn etc). Saved the sounds samples into Google Drive which was then scored by both the mentors. |
| 32 | + |
| 33 | +- Concluded that the generated sound samples cannot be used directly in Music Blocks, they need to be trimmed before integration. |
| 34 | + |
| 35 | +--- |
| 36 | + |
| 37 | +## Goals for This Week |
| 38 | + |
| 39 | +- **Goal 1:** Find another open-source model to generate high quality sound samples. |
| 40 | + |
| 41 | +--- |
| 42 | + |
| 43 | +## This Week's Achievements |
| 44 | + |
| 45 | +1. **Find another open-source model** |
| 46 | + - I was sucessfully able to find another open-source model [TangoFlux](https://huggingface.co/spaces/declare-lab/TangoFlux). However, after testing several prompts, I realized that these models are unable to generate sound samples suitable for use in Music Blocks. After discussing this with the mentors, we decided that an additional tool is needed, one that can clip audio between specific timestamps. This trimmed audio can then be used in Music Blocks, similar to how it's done in Audacity software. |
| 47 | + |
| 48 | +--- |
| 49 | + |
| 50 | +## Next Week's Roadmap |
| 51 | + |
| 52 | +- Create sketches of user interfaces showing how students will interact with the GenAI and the audio trimming tool. |
| 53 | + |
| 54 | +--- |
| 55 | + |
| 56 | +## Proof of Concept (POC) |
| 57 | + |
| 58 | +Since current LLMs are not capable of generating audio that can be used directly in Music Blocks, the following workflow serves as an alternative approach: |
| 59 | + |
| 60 | +- The user first generates an audio using a text-to-audio model such as TangoFlux. |
| 61 | + |
| 62 | + |
| 63 | +- The generated audio is then passed into an audio trimming tool, for demonstration purpose I am using an Audacity software. |
| 64 | + |
| 65 | + |
| 66 | + |
| 67 | + |
| 68 | +- Click on the video below to see how can we import the audio into Music Blocks: |
| 69 | + |
| 70 | +[youtube: eR8hfvNzPTg] |
| 71 | + |
| 72 | +--- |
| 73 | + |
| 74 | +## Acknowledgments |
| 75 | + |
| 76 | +Thank you to my mentors, the Sugar Labs community, and fellow GSoC contributors for ongoing support. |
| 77 | + |
| 78 | +--- |
0 commit comments