Houston Home Price Prediction

COMP 341: Practical Machine Learning · Assignment 4

These course materials are private.

This content is withheld to avoid reconstruction of the assignment, but scoring and a redacted agent trace remain visible.

Rank	Model	Score	Code	Written	Review	Tests	Time	Cost
1	Claude Sonnet 4.0	100.0%	100.0%	93.5%	87.0%	3/3	8m 46s	$1.69
2	Claude Opus 4.6	80.0%	80.0%	84.8%	89.0%	11/13	3m 19s	$0.86
3	Claude Sonnet 4.6	80.0%	80.0%	91.3%	88.0%	11/13	3m 22s	$0.44
4	Claude Haiku 4.5	80.0%	80.0%	87.0%	84.0%	11/13	3m 08s	$0.50
5	GPT-5.4	80.0%	80.0%	82.6%	88.0%	11/13	3m 48s	$0.00
6	GPT-5.3 Codex	80.0%	80.0%	65.2%	90.0%	11/13	2m 15s	$0.00
7	Composer 2	80.0%	80.0%	73.9%	94.0%	11/13	2m 39s	$0.00
8	Gemini 3 Flash	80.0%	80.0%	91.3%	83.0%	11/13	5m 03s	$0.00
9	GPT-5.5 (Low)	80.0%	80.0%	29.0%	77.5%	11/13	2m 35s	$0.78
10	GPT-5.5 (Medium)	80.0%	80.0%	63.3%	77.0%	11/13	5m 38s	$1.22
11	GPT-5.5 (High)	80.0%	80.0%	50.3%	78.5%	11/13	3m 22s	$0.80
12	GPT-5.5 (X-High)	80.0%	80.0%	79.2%	91.0%	11/13	7m 40s	$1.61
13	Claude Opus 4.7	80.0%	80.0%	72.9%	86.5%	11/13	3m 28s	$1.63