AI Co-Pilot Evaluation Framework