yandexdataschool · Inosen-Infinity · Oct 8, 2024
diff --git a/week03_lm/homework_pytorch.ipynb b/week03_lm/homework_pytorch.ipynb
@@ -192,8 +192,8 @@
     "        Note: fixed window LM is effectively performing a convolution over a sequence of words.\n",
     "        This convolution only looks on current and previous words.\n",
     "        Such convolution can be represented as a sequence of 2 operations:\n",
-    "        - pad input vectors by {strides * (filter_size - 1)} zero vectors on the \"left\", do not pad right\n",
-    "        - perform regular convolution with {filter_size} and {strides}\n",
+    "        - pad input vectors by {dilation * (filter_size - 1)} zero vectors on the \"left\", do not pad right\n",
+    "        - perform regular convolution with {filter_size} and {dilations}\n",
     "        \n",
     "        - If you're absolutely lost, here's a hint: use nn.ZeroPad2d((NUM_LEADING_ZEROS, 0, 0, 0))\n",
     "          followed by a nn.Conv1d(..., padding=0). And yes, its okay that padding is technically \"2d\".\n",
@@ -789,7 +789,7 @@
     "So you've learned the building blocks of neural language models, you can now build the ultimate monster:  \n",
     "* Make it char-level, word level or maybe use sub-word units like [bpe](https://github.com/rsennrich/subword-nmt);\n",
     "* Combine convolutions, recurrent cells, pre-trained embeddings and all the black magic deep learning has to offer;\n",
-    "  * Use strides to get larger window size quickly. Here's a [scheme](https://storage.googleapis.com/deepmind-live-cms/documents/BlogPost-Fig2-Anim-160908-r01.gif) from google wavenet.\n",
+    "  * Use dilations to get larger window size quickly. Here's a [scheme](https://storage.googleapis.com/deepmind-live-cms/documents/BlogPost-Fig2-Anim-160908-r01.gif) from google wavenet.\n",
     "* Train on large data. Like... really large. Try [1 Billion Words](http://www.statmt.org/lm-benchmark/1-billion-word-language-modeling-benchmark-r13output.tar.gz) benchmark;\n",
     "* Use training schedules to speed up training. Start with small length and increase over time; Take a look at [one cycle](https://medium.com/@nachiket.tanksale/finding-good-learning-rate-and-the-one-cycle-policy-7159fe1db5d6) for learning rate;\n",
     "\n",

diff --git a/week03_lm/homework_tf.ipynb b/week03_lm/homework_tf.ipynb
@@ -193,8 +193,8 @@
     "        Note: fixed window LM is effectively performing a convolution over a sequence of words.\n",
     "        This convolution only looks on current and previous words.\n",
     "        Such convolution can be represented as a sequence of 2 operations:\n",
-    "        - pad input vectors by {strides * (filter_size - 1)} zero vectors on the \"left\", do not pad right\n",
-    "        - perform regular convolution with {filter_size} and {strides}\n",
+    "        - pad input vectors by dilation * (filter_size - 1)} zero vectors on the \"left\", do not pad right\n",
+    "        - perform regular convolution with {filter_size} and {dilations}\n",
     "        - If you're absolutely lost, here's a hint: use ZeroPadding1D and Conv1D from keras.layers\n",
     "        You can stack several convolutions at once\n",
     "        \"\"\"\n",
@@ -831,7 +831,7 @@
     "So you've learned the building blocks of neural language models, you can now build the ultimate monster:  \n",
     "* Make it char-level, word level or maybe use sub-word units like [bpe](https://github.com/rsennrich/subword-nmt);\n",
     "* Combine convolutions, recurrent cells, pre-trained embeddings and all the black magic deep learning has to offer;\n",
-    "  * Use strides to get larger window size quickly. Here's a [scheme](https://storage.googleapis.com/deepmind-live-cms/documents/BlogPost-Fig2-Anim-160908-r01.gif) from google wavenet.\n",
+    "  * Use dilations to get larger window size quickly. Here's a [scheme](https://storage.googleapis.com/deepmind-live-cms/documents/BlogPost-Fig2-Anim-160908-r01.gif) from google wavenet.\n",
     "* Train on large data. Like... really large. Try [1 Billion Words](http://www.statmt.org/lm-benchmark/1-billion-word-language-modeling-benchmark-r13output.tar.gz) benchmark;\n",
     "* Use training schedules to speed up training. Start with small length and increase over time; Take a look at [one cycle](https://medium.com/@nachiket.tanksale/finding-good-learning-rate-and-the-one-cycle-policy-7159fe1db5d6) for learning rate;\n",
     "\n",