Develops and evaluates datasets for training LLMs. Curates code, refines AI-generated code, and develops performance benchmarks.