I use model parallel mode and set the parameter model-parallel-size=4, it will save four models. how do I use the four saved models to inference?