This quasi-experimental study investigated generative AI (GenAI) tools—Copilot for chemistry and GitHub Copilot for mathematics—on academic achievement and sustainable professional development among 160 undergraduates (40 experimental/control per department) at the University of Baghdad’s Ibn Al-Haitham College of Education for Pure Sciences (2024–2025). Non-random assignment controlled for covariates. Pre/post validated tests (α ≥ .85; 15 MCQ + 5 essay items) measured outcomes. ANOVA revealed significant gains for experimental groups (p < .001, η2 = .41, Cohen’s d = 0.72 [95% CI: 0.45–0.98]). Chemistry excelled in affective domains; mathematics in cognitive/skills. Findings affirm GenAI’s domain-specific efficacy, providing datasets for AI-STEM pedagogy.