自动将代码升级到 TensorFlow 2

#@title Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# https://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

自动将代码升级到 TensorFlow 2#

View on TensorFlow.org

Run in Google Colab

View source on GitHub

Download notebook

TensorFlow 2.0 包含许多 API 变更，例如重新排序了参数，重命名了符号，更改了参数的默认值。手动执行所有这些修改可能很乏味，而且很容易出错。为了简化更改，尽可能地让您无缝过渡到 TF 2.0，TensorFlow 团队创建了 tf_upgrade_v2 实用工具，帮助您将旧版代码转换至新的 API。

注：TensorFlow 1.13 和更高版本（包括所有 TF 2.0 版本）会自动安装 tf_upgrade_v2。

典型的用法如下：

tf_upgrade_v2 \
  --intree my_project/ \
  --outtree my_project_v2/ \
  --reportfile report.txt

将现有 TensorFlow 1.x Python 脚本转换为 TensorFlow 2.0 脚本可以加快升级流程。

转换脚本会尽可能实现自动化处理，但仍有一些语法和样式变更无法通过脚本执行转换。

兼容性模块#

某些 API 符号无法通过简单的字符串替换进行升级。为了确保代码在 TensorFlow 2.0 中仍受支持，升级脚本包含了一个 compat.v1 模块。该模块可将 TF 1.x 符号（如 tf.foo）替换为等效的 tf.compat.v1.foo 引用。虽然该兼容性模块效果不错，但我们仍建议人工校对替换，并尽快将代码迁移到 tf.* 命名空间（而不是 tf.compat.v1 命名空间）中的新 API。

由于 TensorFlow 2.x 模块弃用（例如，tf.flags 和 tf.contrib），切换到 compat.v1 无法解决某些更改。升级此代码可能需要其他库（例如，absl.flags）或切换到 tensorflow/addons 中的软件包。

使用升级脚本#

设置#

开始之前，请确保已安装 TensorlFlow 2.0。

import tensorflow as tf

print(tf.__version__)

克隆 tensorflow/models git 仓库，以便获得一些要测试的代码：

!git clone --branch r1.13.0 --depth 1 https://github.com/tensorflow/models

读取帮助#

脚本应当随 TensorFlow 安装。下面是内置帮助命令：

!tf_upgrade_v2 -h

TF1 代码示例#

下面是一个简单的 TensorFlow 1.0 脚本示例：

!head -n 65 models/samples/cookbook/regression/custom_regression.py | tail -n 10

对于安装的 TensorFlow 2.0，它不会运行：

!(cd models/samples/cookbook/regression &amp;&amp; python custom_regression.py)

单个文件#

升级脚本可以在单个 Python 文件上运行：

!tf_upgrade_v2 \
  --infile models/samples/cookbook/regression/custom_regression.py \
  --outfile /tmp/custom_regression_v2.py

如果无法找到解决代码问题的方法，该脚本会打印错误消息。

目录树#

典型项目（包括下面的简单示例）会使用远不止一个文件。通常需要升级整个软件包，所以该脚本也可以在目录树上运行：

# upgrade the .py files and copy all the other files to the outtree
!tf_upgrade_v2 \
    --intree models/samples/cookbook/regression/ \
    --outtree regression_v2/ \
    --reportfile tree_report.txt

注意关于 dataset.make_one_shot_iterator 函数的一条警告。

现在，对于 TensorFlow 2.0，该脚本已经可以发挥作用：

请注意，凭借 tf.compat.v1 模块，转换的脚本在 TensorFlow 1.14 中也可以运行。

!(cd regression_v2 &amp;&amp; python custom_regression.py 2&gt;&amp;1) | tail

详细报告#

该脚本还会报告一个详细更改列表。在本例中，它发现了一个可能不安全的转换，因此在文件顶部包含了一条警告：

!head -n 20 tree_report.txt

再次注意关于 Dataset.make_one_shot_iterator 函数的一条警告。

在其他情况下，对于非常用更改，输出会解释原因：

%%writefile dropout.py
import tensorflow as tf

d = tf.nn.dropout(tf.range(10), 0.2)
z = tf.zeros_like(d, optimize=False)

!tf_upgrade_v2 \
  --infile dropout.py \
  --outfile dropout_v2.py \
  --reportfile dropout_report.txt &gt; /dev/null

!cat dropout_report.txt

以下是经过修改的文件内容，请注意脚本如何通过添加参数名来处理移动和重命名的参数：

!cat dropout_v2.py

更大的项目可能会包含一些错误，例如转换 DeepLab 模型：

!tf_upgrade_v2 \
    --intree models/research/deeplab \
    --outtree deeplab_v2 \
    --reportfile deeplab_report.txt &gt; /dev/null

它会生成输出文件：

!ls deeplab_v2

但是其中包含错误。该报告会帮助您找到确保代码可以正常运行所需要解决的错误。下面是前三个错误：

!cat deeplab_report.txt | grep -i models/research/deeplab | grep -i error | head -n 3

“安全”模式#

该转换脚本还有一种介入度相对较低的 SAFETY 模式。在此模式下，只需更改导入来使用 tensorflow.compat.v1 模块：

!cat dropout.py

!tf_upgrade_v2 --mode SAFETY --infile dropout.py --outfile dropout_v2_safe.py &gt; /dev/null

!cat dropout_v2_safe.py

如您所见，这不会升级代码，但允许 TensorFlow 1 代码在 TensorFlow 2 中运行

注意事项#

在运行此脚本之前，不要手动更新代码的某些部分。尤其是更改了参数顺序的函数（如 tf.argmax 或 tf.batch_to_space），否则会导致代码无法正确添加与现有代码匹配的关键字参数。
该脚本假定使用 import tensorflow as tf 导入 tensorflow。
该脚本不会更改参数顺序，但是会将关键字参数添加到本身已更改参数顺序的函数。
请查阅 tf2up.ml，找到一款方便的工具来升级 GitHub 仓库中的 Jupyter 笔记本和 Python 文件。

要报告升级脚本错误或提出功能请求，请在 GitHub 上提交问题。如果您在测试 TensorFlow 2.0，我们非常希望了解您的反馈意见！请加入 TF 2.0 测试社区，将您的问题和讨论发送到 testing@tensorflow.org。