model soup2 [Paper Review] Sparse Model Soups : A Recipe For Improved Pruning Via Model Averaging 논문 원본 : https://arxiv.org/abs/2306.16788 Sparse Model Soups: A Recipe for Improved Pruning via Model AveragingNeural networks can be significantly compressed by pruning, yielding sparse models with reduced storage and computational demands while preserving predictive performance. Model soups (Wortsman et al., 2022) enhance generalization and out-of-distribution (Oarxiv.org Abstract신경망은 pruning을 .. 2024. 12. 22. [Paper review] Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time 논문 원본 : https://arxiv.org/abs/2203.05482 Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference timeThe conventional recipe for maximizing model accuracy is to (1) train multiple models with various hyperparameters and (2) pick the individual model which performs best on a held-out validation set, discarding the remainder. In this paper, we re.. 2024. 12. 22. 이전 1 다음