IS 733: Collaborative Data Mining and Knowledge Discovery Project
Welcome to the Collaborative Data Mining and Knowledge Discovery repository, part of the IS 733 curriculum at UMBC for Spring 2025. This project serves as a hands-on, collaborative space where students apply theoretical concepts in data mining to practical problems, using tools like Python, Jupyter Notebooks, and WEKA. Beyond its immediate academic use, this repository is designed to grow and serve as a resource for future students, fostering innovation, collaboration, and a deeper understanding of data mining.
This project aims to:
- Provide students with real-world experience in applying data mining techniques such as classification, clustering, and association rule mining.
- Encourage teamwork and co-location by fostering collaboration among diverse teams.
- Build a shared knowledge base that will be continuously enhanced by future students.
- Highlight the importance of ethical considerations in data mining and promote responsible data practices.
- Prepare students for research and practical challenges in the rapidly evolving field of data science.
This repository is more than just a class project; it is a foundation for future learning and collaboration. Future students are encouraged to:
- Build upon the existing work, contributing innovative ideas and solutions.
- Leverage the repository as a resource for exploring advanced data mining topics and techniques.
- Collaborate with peers to tackle challenging data problems and publish findings.
- Use this space as a stepping stone for academic and professional endeavors in data science.
The repository is organized as follows:
- /docs: Course materials, lecture slides, and supporting documentation.
- /code: Scripts, notebooks, and implementations of data mining algorithms.
- /data: Datasets for practice and project work.
- /projects: Contributions from student groups, including reports, presentations, and posters.
- /resources: Links to textbooks, research papers, and online tools for data mining.
- Programming Languages: Python (essential), R (optional for specific tasks)
- Platforms: Jupyter Notebook for coding and visualization.
- Data Mining Software: WEKA, an open-source data mining toolkit.
- Collaboration: GitHub for version control and team collaboration.
This project would not be possible without the hardwork of all of the students who were in class with of Dr. Basnyat and the support of our Teaching Assistant, **. Special thanks to UMBC’s Department of Information Systems for fostering an environment that encourages innovation and collaboration.
- Clone the repository:
git clone https://github.com/mlteach/is7332025/-link.git