vibe-openscad

3D 空間認識ベンチマーク

複数の AI モデル / ハーネスに同じ課題で OpenSCAD コードを生成させ、結果を横並びで比較する。

課題
9 / 9
run
1084 (1031 ok / 53 fail)
累計コスト
$20.37 内 $0.3130 は失敗
累計時間
573.0m 内 51.1m は失敗

中央に貫通穴を持つ立方体 tier 1 tier-1-cube-with-hole

bare
anthropic
11 step 39.1s 3739/1326t $0.0518
openai
google
openai (self-hosted)
google (self-hosted)
nvidia (self-hosted)
qwen (self-hosted)

階段状ピラミッド tier 1 tier-1-stepped-pyramid

bare
anthropic
11 step 33.8s 5071/1942t $0.0739
openai
google
openai (self-hosted)
google (self-hosted)
nvidia (self-hosted)
qwen (self-hosted)

取手付きマグカップ tier 1 tier-1-mug +6 過去 run

bare
anthropic
11 step 90.0s 6128/6453t $0.1920
9 step 109.3s 6387/6418t $0.1154
openai
7 step 112.2s 5210/14846t $0.0266
7 step 852.9s 4301/37053t $0.0752
google
openai (self-hosted)
google (self-hosted)
nvidia (self-hosted)
qwen (self-hosted)

六角ボルト M8x30 tier 2 tier-2-hex-bolt

bare
anthropic
11 step 47.1s 6009/2839t $0.1010
openai
google
openai (self-hosted)
google (self-hosted)
nvidia (self-hosted)
qwen (self-hosted)

皿穴付き L 字金具 tier 2 tier-2-l-bracket

bare
anthropic
11 step 163.1s 9765/12312t $0.3566
9 step 169.6s 8636/10395t $0.1818
openai
7 step 1062.3s 6096/116744t $0.0470
google
openai (self-hosted)
google (self-hosted)
nvidia (self-hosted)
qwen (self-hosted)

取手位置を指定したマグカップ tier 2 tier-2-offset-handle-mug

bare
anthropic
11 step 142.9s 8739/10629t $0.3094
9 step 429.0s 12001/22495t $0.3734
openai
7 step 127.0s 9084/17117t $0.0110
7 step 756.7s 5879/50978t $0.1034
7 step 936.1s 3472/93357t $0.0375
5 step 268.6s 4534/33922t $0.3795
google
6 step 26.8s 5346/5906t $0.0029
openai (self-hosted)
google (self-hosted)
nvidia (self-hosted)
qwen (self-hosted)

バット蝶番(2 枚 + ピン軸) tier 3 tier-3-butt-hinge

bare
anthropic
11 step 248.2s 15189/18535t $0.5393
9 step 240.0s 13852/15375t $0.2722
openai
7 step 170.8s 8689/25967t $0.0162
7 step 1170.2s 10690/57422t $0.1175
7 step 1323.7s 9229/111303t $0.0450
7 step 397.8s 7367/53331t $0.2428
google
6 step 98.3s 21249/22522t $0.0111
openai (self-hosted)
google (self-hosted)
nvidia (self-hosted)
qwen (self-hosted)

距離センサ GP2Y0D413K0F の外形モデリング(データシート図から) tier 4 tier-4-gp2y0d413k

pdf-page
anthropic
openai
google
google (self-hosted)

距離センサ GP2Y0A21YK0F の外形モデリング(データシート図から) tier 4 tier-4-gp2y0a21yk

pdf-page
anthropic
openai
google
google (self-hosted)