Skip to content

Commit 55c5f99

Browse files
authored
Merge pull request NVIDIA#286 from eric-haibin-lin/patch-1
Fix a typo
2 parents 0571072 + 1ae1592 commit 55c5f99

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

TensorFlow/LanguageModeling/BERT/optimization.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -356,7 +356,7 @@ def apply_gradients(self, grads_and_vars, global_step=None, name=None,
356356
# the correct way of using L2 regularization/weight decay with Adam,
357357
# since that will interact with the m and v parameters in strange ways.
358358
#
359-
# Instead we want ot decay the weights in a manner that doesn't interact
359+
# Instead we want to decay the weights in a manner that doesn't interact
360360
# with the m/v parameters. This is equivalent to adding the square
361361
# of the weights to the loss with plain (non-momentum) SGD.
362362
if self._do_use_weight_decay(param_name):

0 commit comments

Comments
 (0)