A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.