I’m building a free bilingual machine-learning notebook course — looking for feedback on structure and coverage [R]
Our take
The burgeoning landscape of accessible machine learning education just received a significant boost with the creation of a free, bilingual machine-learning notebook course, spearheaded by user /u/abolfazl1363. This initiative, housed on GitHub, offers a practical, notebook-first curriculum designed for local execution and step-by-step study. The dual-language approach, encompassing both English and Persian/Farsi, is particularly noteworthy, addressing a critical need for inclusive learning resources. It’s a welcome complement to discussions, like those surrounding PhD internship applications [I’d Like to Try for a Google PhD Internship [R]], highlighting the ongoing demand for robust educational pathways within the field. Furthermore, the increasing presence of quantitative firms at events such as ICML 2026 [Quant firms at ICML 2026 [D]] underscores the growing professional interest and, implicitly, the need for a skilled workforce—a need initiatives like this directly address. The commitment to providing hands-on practice, complete with datasets and exercises, is particularly commendable and distinguishes it from more theoretical approaches.
The thoughtful inclusion of a wide range of topics—from fundamental ML concepts and data preprocessing to advanced areas like time series analysis, anomaly detection, and responsible ML—demonstrates a well-structured approach to curriculum design. The developer’s conscious effort to solicit feedback on chapter order, missing topics, and the utility of bilingual notebooks is a hallmark of a genuinely community-driven project. Avoiding the pitfalls of mere “copy/paste code” by emphasizing practical application is also a crucial design choice. This resonates deeply with the broader challenge of ensuring effective learning in a field often characterized by complex jargon and rapid technological advancements, a challenge also seen in discussions around poster presentations [ICML Poster [D]], where clarity and conciseness are paramount. The project’s open-source nature encourages collaboration and iterative improvement, making it a valuable asset for both learners and educators.
The significance of this project extends beyond its immediate educational value. It exemplifies the growing trend of democratizing access to AI knowledge, breaking down barriers for non-native English speakers and individuals who may lack access to traditional educational institutions. By offering a practical, hands-on learning experience, it empowers individuals to develop tangible skills and contribute to the rapidly evolving field of machine learning. The choice of Jupyter Notebooks as the delivery format is strategic, allowing learners to directly interact with the code, experiment with different parameters, and visualize the results – a key element in fostering genuine understanding. This focus on practical application directly addresses a common critique of purely theoretical AI education.
Looking ahead, it will be interesting to observe the trajectory of this project and the impact of the bilingual component. Will it inspire similar initiatives targeting other languages and underserved communities? The success of this venture hinges not only on the quality of the materials but also on the level of engagement and contribution from the broader community. Ultimately, the widespread adoption and adaptation of this resource could significantly contribute to a more diverse and inclusive AI workforce, answering the implicit question of how to best equip the next generation of data scientists with the tools and knowledge they need to thrive in an increasingly data-driven world.
Hi everyone,
I’m building an open-source machine-learning tutorial repository in Jupyter Notebook format:
https://github.com/mohammadijoo/Machine_Learning_Tutorials
The course is bilingual: English and Persian/Farsi versions are organized in parallel. The goal is to make a practical, notebook-first ML curriculum that students can run locally and study step by step.
Current focus areas include:
- ML foundations and workflow
- data cleaning, preprocessing, feature engineering
- regression and classification
- tree models and ensembles
- clustering and dimensionality reduction
- evaluation, cross-validation, calibration
- time series, anomaly detection, responsible ML, and MLOps concepts
- datasets and exercises for hands-on practice
I would appreciate feedback on:
- whether the chapter order makes sense for beginners
- what important classical ML topics are missing
- whether bilingual notebooks are useful for non-native English learners
- how to make the notebooks more practical without turning them into only “copy/paste code”
I’m sharing this as a free educational resource and would value constructive criticism.
[link] [comments]
Read on the original site
Open the publisher's page for the full experience