共 22 条
[1]
Agarwal A., 2011, P ADV NEUR INF PROC
[2]
[Anonymous], 2012, IEEE SIGNAL PROCESSI
[3]
[Anonymous], ADADELTA: An Adaptive Learning Rate Method
[4]
[Anonymous], 2013, INT C MACH LEARN
[5]
Dean J., 2012, NIPS
[6]
Duchi J, 2011, J MACH LEARN RES, V12, P2121
[7]
LeCun Y., 2004, P COMP VIS PATT REC
[8]
Martens J., 2010, P ICML
[9]
Nesterov Y., 2007, Gradient Methods for Minimizing Composite Objective Function
[10]
Povey D., 2015, P ICLR 2015 SAN DIEG