LinkDecoder
diff --git a/‎CUDA-Optimized/FastSpeech/README.md‎
Lines changed: 2 additions & 0 deletions b/‎CUDA-Optimized/FastSpeech/README.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎Kaldi/SpeechRecognition/README.md‎
Lines changed: 2 additions & 0 deletions b/‎Kaldi/SpeechRecognition/README.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎MxNet/Classification/RN50v1.5/README.md‎
Lines changed: 2 additions & 0 deletions b/‎MxNet/Classification/RN50v1.5/README.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎PyTorch/Classification/ConvNets/efficientnet/README.md‎
Lines changed: 2 additions & 0 deletions b/‎PyTorch/Classification/ConvNets/efficientnet/README.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎PyTorch/Classification/ConvNets/resnet50v1.5/README.md‎
Lines changed: 2 additions & 0 deletions b/‎PyTorch/Classification/ConvNets/resnet50v1.5/README.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎PyTorch/Classification/ConvNets/resnext101-32x4d/README.md‎
Lines changed: 2 additions & 0 deletions b/‎PyTorch/Classification/ConvNets/resnext101-32x4d/README.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎PyTorch/Classification/ConvNets/se-resnext101-32x4d/README.md‎
Lines changed: 2 additions & 0 deletions b/‎PyTorch/Classification/ConvNets/se-resnext101-32x4d/README.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎PyTorch/Classification/ConvNets/triton/resnet50/README.md‎
Lines changed: 2 additions & 0 deletions b/‎PyTorch/Classification/ConvNets/triton/resnet50/README.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎PyTorch/Classification/ConvNets/triton/resnext101-32x4d/README.md‎
Lines changed: 2 additions & 0 deletions b/‎PyTorch/Classification/ConvNets/triton/resnext101-32x4d/README.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎PyTorch/Classification/ConvNets/triton/se-resnext101-32x4d/README.md‎
Lines changed: 2 additions & 0 deletions b/‎PyTorch/Classification/ConvNets/triton/se-resnext101-32x4d/README.md‎
Lines changed: 2 additions & 0 deletions
@@ -315,6 +315,8 @@ Sample result waveforms are [FP32](fastspeech/trt/samples) and [FP16](fastspeech
 
 ## Performance
 
+The performance measurements in this document were conducted at the time of publication and may not reflect the performance achieved from NVIDIA’s latest software release. For the most up-to-date performance measurements, go to [NVIDIA Data Center Deep Learning Product Performance](https://developer.nvidia.com/deep-learning-performance-training-inference).
+
 ### Benchmarking
 
 The following section shows how to run benchmarks measuring the model performance in training and inference modes.
 
@@ -192,6 +192,8 @@ you can set `count` to `1` in the [`instance_group` section](https://docs.nvidia
 
 ## Performance
 
+The performance measurements in this document were conducted at the time of publication and may not reflect the performance achieved from NVIDIA’s latest software release. For the most up-to-date performance measurements, go to [NVIDIA Data Center Deep Learning Product Performance](https://developer.nvidia.com/deep-learning-performance-training-inference).
+
 
 ### Metrics
 
 
@@ -552,6 +552,8 @@ By default:
 
 ## Performance
 
+The performance measurements in this document were conducted at the time of publication and may not reflect the performance achieved from NVIDIA’s latest software release. For the most up-to-date performance measurements, go to [NVIDIA Data Center Deep Learning Product Performance](https://developer.nvidia.com/deep-learning-performance-training-inference).
+
 ### Benchmarking
 
 To benchmark training and inference, run:
 
@@ -492,6 +492,8 @@ Quantized models could also be used to classify new images using the `classify.p
 
 ## Performance
 
+The performance measurements in this document were conducted at the time of publication and may not reflect the performance achieved from NVIDIA’s latest software release. For the most up-to-date performance measurements, go to [NVIDIA Data Center Deep Learning Product Performance](https://developer.nvidia.com/deep-learning-performance-training-inference).
+
 ### Benchmarking
 
 The following section shows how to run benchmarks measuring the model performance in training and inference modes.
 
@@ -498,6 +498,8 @@ To run inference on JPEG image using pretrained weights:
 
 ## Performance
 
+The performance measurements in this document were conducted at the time of publication and may not reflect the performance achieved from NVIDIA’s latest software release. For the most up-to-date performance measurements, go to [NVIDIA Data Center Deep Learning Product Performance](https://developer.nvidia.com/deep-learning-performance-training-inference).
+
 ### Benchmarking
 
 The following section shows how to run benchmarks measuring the model performance in training and inference modes.
 
@@ -481,6 +481,8 @@ To run inference on JPEG image using pretrained weights:
 
 ## Performance
 
+The performance measurements in this document were conducted at the time of publication and may not reflect the performance achieved from NVIDIA’s latest software release. For the most up-to-date performance measurements, go to [NVIDIA Data Center Deep Learning Product Performance](https://developer.nvidia.com/deep-learning-performance-training-inference).
+
 ### Benchmarking
 
 The following section shows how to run benchmarks measuring the model performance in training and inference modes.
 
@@ -483,6 +483,8 @@ To run inference on JPEG image using pretrained weights:
 
 ## Performance
 
+The performance measurements in this document were conducted at the time of publication and may not reflect the performance achieved from NVIDIA’s latest software release. For the most up-to-date performance measurements, go to [NVIDIA Data Center Deep Learning Product Performance](https://developer.nvidia.com/deep-learning-performance-training-inference).
+
 ### Benchmarking
 
 The following section shows how to run benchmarks measuring the model performance in training and inference modes.
 
@@ -325,6 +325,8 @@ we can consider that all clients are local.
 
 ## Performance
 
+The performance measurements in this document were conducted at the time of publication and may not reflect the performance achieved from NVIDIA’s latest software release. For the most up-to-date performance measurements, go to [NVIDIA Data Center Deep Learning Product Performance](https://developer.nvidia.com/deep-learning-performance-training-inference).
+
 
 ### Offline scenario
 This table lists the common variable parameters for all performance measurements:
 
@@ -194,6 +194,8 @@ To process static configuration logs, `triton/scripts/process_output.sh` script
 
 ## Performance
 
+The performance measurements in this document were conducted at the time of publication and may not reflect the performance achieved from NVIDIA’s latest software release. For the most up-to-date performance measurements, go to [NVIDIA Data Center Deep Learning Product Performance](https://developer.nvidia.com/deep-learning-performance-training-inference).
+
 ### Dynamic batching performance
 The Triton Inference Server has a dynamic batching mechanism built-in that can be enabled. When it is enabled, the server creates inference batches from multiple received requests. This allows us to achieve better performance than doing inference on each single request. The single request is assumed to be a single image that needs to be inferenced. With dynamic batching enabled, the server will concatenate single image requests into an inference batch. The upper bound of the size of the inference batch is set to 64. All these parameters are configurable.
 
 
@@ -195,6 +195,8 @@ To process static configuration logs, `triton/scripts/process_output.sh` script
 
 ## Performance
 
+The performance measurements in this document were conducted at the time of publication and may not reflect the performance achieved from NVIDIA’s latest software release. For the most up-to-date performance measurements, go to [NVIDIA Data Center Deep Learning Product Performance](https://developer.nvidia.com/deep-learning-performance-training-inference).
+
 ### Dynamic batching performance
 The Triton Inference Server has a dynamic batching mechanism built-in that can be enabled. When it is enabled, the server creates inference batches from multiple received requests. This allows us to achieve better performance than doing inference on each single request. The single request is assumed to be a single image that needs to be inferenced. With dynamic batching enabled, the server will concatenate single image requests into an inference batch. The upper bound of the size of the inference batch is set to 64. All these parameters are configurable.