深度学习圣经:Deep learning
本帖最后由 keer_zu 于 2024-5-17 09:34 编辑书
If we had separate parametersfor each value of the time index, we could not generalize to sequence lengths notseen during training, nor share statistical strength across different sequence lengthsand across different positions in time.
页:
[1]