Clean up inference config: remove training-only flags, set bd_size=32 default, dtype=bfloat16 cb43b83 verified WuChengyue commited on May 6