OASB Skills Security Benchmark

Metadata

Precision, Recall, F1, Flag Rate, False Positive Rate (lower is better), Average Scan Time (lower is better)

Rank	Subject	F1	Model Match	Provenance	Sampled
1	NanoMind TME v0.5.0 (model only)	0.892	—	Imported	2026-05-27
2	HMA Full Pipeline (AST + NanoMind v0.5.0)	0.813	—	Imported	2026-05-27
3	HMA Static Patterns (no NanoMind)	0.675	—	Imported	2026-05-27