tensorflow
diff --git a/‎docs/api_docs/python/_toc.yaml‎
Lines changed: 24 additions & 0 deletions b/‎docs/api_docs/python/_toc.yaml‎
Lines changed: 24 additions & 0 deletions
diff --git a/‎docs/api_docs/python/index.md‎
Lines changed: 12 additions & 1 deletion b/‎docs/api_docs/python/index.md‎
Lines changed: 12 additions & 1 deletion
diff --git a/‎docs/api_docs/python/tft.md‎
Lines changed: 2 additions & 0 deletions b/‎docs/api_docs/python/tft.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎docs/api_docs/python/tft/MeanAndVarCombiner.md‎
Lines changed: 13 additions & 0 deletions b/‎docs/api_docs/python/tft/MeanAndVarCombiner.md‎
Lines changed: 13 additions & 0 deletions
diff --git a/‎docs/api_docs/python/tft/apply_buckets.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/api_docs/python/tft/apply_buckets.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/api_docs/python/tft/apply_buckets_with_interpolation.md‎
Lines changed: 36 additions & 0 deletions b/‎docs/api_docs/python/tft/apply_buckets_with_interpolation.md‎
Lines changed: 36 additions & 0 deletions
diff --git a/‎docs/api_docs/python/tft/apply_vocabulary.md‎
Lines changed: 3 additions & 3 deletions b/‎docs/api_docs/python/tft/apply_vocabulary.md‎
Lines changed: 3 additions & 3 deletions
diff --git a/‎docs/api_docs/python/tft/compute_and_apply_vocabulary.md‎
Lines changed: 6 additions & 1 deletion b/‎docs/api_docs/python/tft/compute_and_apply_vocabulary.md‎
Lines changed: 6 additions & 1 deletion
diff --git a/‎docs/api_docs/python/tft/sparse_tensor_to_dense_with_shape.md‎
Lines changed: 4 additions & 1 deletion b/‎docs/api_docs/python/tft/sparse_tensor_to_dense_with_shape.md‎
Lines changed: 4 additions & 1 deletion
diff --git a/‎docs/api_docs/python/tft/vocabulary.md‎
Lines changed: 12 additions & 1 deletion b/‎docs/api_docs/python/tft/vocabulary.md‎
Lines changed: 12 additions & 1 deletion
@@ -8,6 +8,8 @@ toc:
       path: /tfx/transform/api_docs/python/tft/apply_analyzer
     - title: apply_buckets
       path: /tfx/transform/api_docs/python/tft/apply_buckets
+    - title: apply_buckets_with_interpolation
+      path: /tfx/transform/api_docs/python/tft/apply_buckets_with_interpolation
     - title: apply_function
       path: /tfx/transform/api_docs/python/tft/apply_function
     - title: apply_function_with_checkpoint
@@ -112,3 +114,25 @@ toc:
       path: /tfx/transform/api_docs/python/tft_beam/WriteMetadata
     - title: WriteTransformFn
       path: /tfx/transform/api_docs/python/tft_beam/WriteTransformFn
+  - title: tft_beam.analyzer_cache
+    section:
+    - title: Overview
+      path: /tfx/transform/api_docs/python/tft_beam/analyzer_cache
+    - title: make_cache_entry_key
+      path: /tfx/transform/api_docs/python/tft_beam/analyzer_cache/make_cache_entry_key
+    - title: make_dataset_key
+      path: /tfx/transform/api_docs/python/tft_beam/analyzer_cache/make_dataset_key
+    - title: ReadAnalysisCacheFromFS
+      path: /tfx/transform/api_docs/python/tft_beam/analyzer_cache/ReadAnalysisCacheFromFS
+    - title: validate_dataset_keys
+      path: /tfx/transform/api_docs/python/tft_beam/analyzer_cache/validate_dataset_keys
+    - title: WriteAnalysisCacheToFS
+      path: /tfx/transform/api_docs/python/tft_beam/analyzer_cache/WriteAnalysisCacheToFS
+  - title: tft_beam.info_theory
+    section:
+    - title: Overview
+      path: /tfx/transform/api_docs/python/tft_beam/info_theory
+    - title: calculate_partial_expected_mutual_information
+      path: /tfx/transform/api_docs/python/tft_beam/info_theory/calculate_partial_expected_mutual_information
+    - title: calculate_partial_mutual_information
+      path: /tfx/transform/api_docs/python/tft_beam/info_theory/calculate_partial_mutual_information
@@ -9,6 +9,7 @@
 *  <a href="./tft/TFTransformOutput.md"><code>tft.TFTransformOutput</code></a>
 *  <a href="./tft/apply_analyzer.md"><code>tft.apply_analyzer</code></a>
 *  <a href="./tft/apply_buckets.md"><code>tft.apply_buckets</code></a>
+*  <a href="./tft/apply_buckets_with_interpolation.md"><code>tft.apply_buckets_with_interpolation</code></a>
 *  <a href="./tft/apply_function.md"><code>tft.apply_function</code></a>
 *  <a href="./tft/apply_function_with_checkpoint.md"><code>tft.apply_function_with_checkpoint</code></a>
 *  <a href="./tft/apply_pyfunc.md"><code>tft.apply_pyfunc</code></a>
@@ -52,4 +53,14 @@
 *  <a href="./tft_beam/ReadTransformFn.md"><code>tft_beam.ReadTransformFn</code></a>
 *  <a href="./tft_beam/TransformDataset.md"><code>tft_beam.TransformDataset</code></a>
 *  <a href="./tft_beam/WriteMetadata.md"><code>tft_beam.WriteMetadata</code></a>
-*  <a href="./tft_beam/WriteTransformFn.md"><code>tft_beam.WriteTransformFn</code></a>
+*  <a href="./tft_beam/WriteTransformFn.md"><code>tft_beam.WriteTransformFn</code></a>
+*  <a href="./tft_beam/analyzer_cache.md"><code>tft_beam.analyzer_cache</code></a>
+*  <a href="./tft_beam/analyzer_cache/ReadAnalysisCacheFromFS.md"><code>tft_beam.analyzer_cache.ReadAnalysisCacheFromFS</code></a>
+*  <a href="./tft_beam/analyzer_cache/WriteAnalysisCacheToFS.md"><code>tft_beam.analyzer_cache.WriteAnalysisCacheToFS</code></a>
+*  <a href="./tft_beam/analyzer_cache/make_cache_entry_key.md"><code>tft_beam.analyzer_cache.make_cache_entry_key</code></a>
+*  <a href="./tft_beam/analyzer_cache/make_dataset_key.md"><code>tft_beam.analyzer_cache.make_dataset_key</code></a>
+*  <a href="./tft_beam/analyzer_cache/validate_dataset_keys.md"><code>tft_beam.analyzer_cache.validate_dataset_keys</code></a>
+*  <a href="./tft_beam/info_theory.md"><code>tft_beam.info_theory</code></a>
+*  <a href="./tft_beam/info_theory/calculate_partial_expected_mutual_information.md"><code>tft_beam.info_theory.calculate_partial_expected_mutual_information</code></a>
+*  <a href="./tft_beam/info_theory/calculate_partial_mutual_information.md"><code>tft_beam.info_theory.calculate_partial_mutual_information</code></a>
+*  <a href="./tft_beam/info_theory/math.md"><code>tft_beam.info_theory.math</code></a>
@@ -33,6 +33,8 @@ Init module for TF.Transform.
 
 [`apply_buckets(...)`](./tft/apply_buckets.md): Returns a bucketized column, with a bucket index assigned to each input.
 
+[`apply_buckets_with_interpolation(...)`](./tft/apply_buckets_with_interpolation.md): Interpolates within the provided buckets and then normalizes to 0 to 1.
+
 [`apply_function(...)`](./tft/apply_function.md): Deprecated function, equivalent to fn(*args). (deprecated)
 
 [`apply_function_with_checkpoint(...)`](./tft/apply_function_with_checkpoint.md): Applies a tensor-in-tensor-out function with variables to some `Tensor`s.
 
@@ -4,6 +4,7 @@
 <meta itemprop="property" content="accumulator_coder"/>
 <meta itemprop="property" content="__init__"/>
 <meta itemprop="property" content="add_input"/>
+<meta itemprop="property" content="compute_running_update"/>
 <meta itemprop="property" content="create_accumulator"/>
 <meta itemprop="property" content="extract_output"/>
 <meta itemprop="property" content="merge_accumulators"/>
@@ -62,6 +63,18 @@ Composes an accumulator from batch_values and calls merge_accumulators.
 
 A `_MeanAndVarAccumulator` which is accumulator and batch_values combined.
 
+<h3 id="compute_running_update"><code>compute_running_update</code></h3>
+
+``` python
+compute_running_update(
+    total_count,
+    current_count,
+    update
+)
+```
+
+Numerically stable way of computing a streaming batched update.
+
 <h3 id="create_accumulator"><code>create_accumulator</code></h3>
 
 ``` python
 
@@ -20,7 +20,7 @@ Returns a bucketized column, with a bucket index assigned to each input.
 * <b>`x`</b>: A numeric input `Tensor` or `SparseTensor` whose values should be mapped
       to buckets.  For `SparseTensor`s, the non-missing values will be mapped
       to buckets and missing value left missing.
-* <b>`bucket_boundaries`</b>: The bucket boundaries represented as a rank 1 `Tensor`.
+* <b>`bucket_boundaries`</b>: The bucket boundaries represented as a rank 2 `Tensor`.
 * <b>`name`</b>: (Optional) A name for this operation.
 
 
 
@@ -0,0 +1,36 @@
+<div itemscope itemtype="http://developers.google.com/ReferenceObject">
+<meta itemprop="name" content="tft.apply_buckets_with_interpolation" />
+<meta itemprop="path" content="Stable" />
+</div>
+
+# tft.apply_buckets_with_interpolation
+
+``` python
+tft.apply_buckets_with_interpolation(
+    x,
+    bucket_boundaries,
+    name=None
+)
+```
+
+Interpolates within the provided buckets and then normalizes to 0 to 1.
+
+A method for normalizing continuous numeric data to the range [0, 1].
+Numeric values are first bucketized according to the provided boundaries, then
+linearly interpolated within their respective bucket ranges. Finally, the
+interpolated values are normalized to the range [0, 1]. Values that are
+less than or equal to the lowest boundary, or greater than or equal to the
+highest boundary, will be mapped to 0 and 1 respectively.
+
+#### Args:
+
+* <b>`x`</b>: A numeric input `Tensor` (tf.float32, tf.float64, tf.int32, tf.int64).
+* <b>`bucket_boundaries`</b>: Sorted bucket boundaries as a rank-2 `Tensor`.
+* <b>`name`</b>: (Optional) A name for this operation.
+
+
+#### Returns:
+
+A `Tensor` of the same shape as `x`, normalized to the range [0, 1]. If the
+  input x is tf.float64, the returned values will be tf.float64.
+  Otherwise, returned values are tf.float32.
@@ -28,9 +28,9 @@ files. This behavior will likely be fixed/improved in the future.
 
 #### Args:
 
-* <b>`x`</b>: A `Tensor` or `SparseTensor` of type tf.string to which the vocabulary
-    transformation should be applied.
-    The column names are those intended for the transformed tensors.
+* <b>`x`</b>: A categorical `Tensor` or `SparseTensor` of type tf.string or
+    tf.int[8|16|32|64] to which the vocabulary transformation should be
+    applied. The column names are those intended for the transformed tensors.
 * <b>`deferred_vocab_filename_tensor`</b>: The deferred vocab filename tensor as
     returned by <a href="../tft/vocabulary.md"><code>tft.vocabulary</code></a>.
 * <b>`default_value`</b>: The value to use for out-of-vocabulary values, unless
 
@@ -20,6 +20,7 @@ tft.compute_and_apply_vocabulary(
     coverage_top_k=None,
     coverage_frequency_threshold=None,
     key_fn=None,
+    fingerprint_shuffle=False,
     name=None
 )
 ```
@@ -37,7 +38,7 @@ operation.
 
 #### Args:
 
-* <b>`x`</b>: A `Tensor` or `SparseTensor` of type tf.string.
+* <b>`x`</b>: A `Tensor` or `SparseTensor` of type tf.string or tf.int[8|16|32|64].
 * <b>`default_value`</b>: The value to use for out-of-vocabulary values, unless
     'num_oov_buckets' is greater than zero.
 * <b>`top_k`</b>: Limit the generated vocabulary to the first `top_k` elements. If set
@@ -73,6 +74,10 @@ operation.
 * <b>`key_fn`</b>: (Optional), (Experimental) A fn that takes in a single entry of `x`
     and returns the corresponding key for coverage calculation. If this is
     `None`, no coverage arm is added to the vocabulary.
+* <b>`fingerprint_shuffle`</b>: (Optional), (Experimental) Whether to sort the
+    vocabularies by fingerprint instead of counts. This is useful for load
+    balancing on the training parameter servers. Shuffle only happens while
+    writing the files, so all the filters above will still take effect.
 * <b>`name`</b>: (Optional) A name for this operation.
 
 
 
@@ -8,7 +8,8 @@
 ``` python
 tft.sparse_tensor_to_dense_with_shape(
     x,
-    shape
+    shape,
+    default_value=0
 )
 ```
 
@@ -18,6 +19,8 @@ Converts a `SparseTensor` into a dense tensor and sets its shape.
 
 * <b>`x`</b>: A `SparseTensor`.
 * <b>`shape`</b>: The desired shape of the densified `Tensor`.
+* <b>`default_value`</b>: (Optional) Value to set for indices not specified. Defaults
+    to zero.
 
 
 #### Returns:
 
@@ -19,6 +19,7 @@ tft.vocabulary(
     coverage_top_k=None,
     coverage_frequency_threshold=None,
     key_fn=None,
+    fingerprint_shuffle=False,
     name=None
 )
 ```
@@ -33,6 +34,10 @@ In case one of the tokens contains the '\n' or '\r' characters or is empty it
 will be discarded since we are currently writing the vocabularies as text
 files. This behavior will likely be fixed/improved in the future.
 
+If an integer `Tensor` is provided, its semantic type should be categorical
+not a continuous/numeric, since computing a vocabulary over a continuous
+feature is not appropriate.
+
 The unique values are sorted by decreasing frequency and then reverse
 lexicographical order (e.g. [('a', 5), ('c', 3), ('b', 3)]).
 
@@ -64,7 +69,8 @@ within each vocabulary entry (b/117796748).
 
 #### Args:
 
-* <b>`x`</b>: An input `Tensor` or `SparseTensor` with dtype tf.string.
+* <b>`x`</b>: A categorical/discrete input `Tensor` or `SparseTensor` with dtype
+    tf.string or tf.int[8|16|32|64].
 * <b>`top_k`</b>: Limit the generated vocabulary to the first `top_k` elements. If set
     to None, the full vocabulary is generated.
 * <b>`frequency_threshold`</b>: Limit the generated vocabulary only to elements whose
@@ -98,6 +104,11 @@ within each vocabulary entry (b/117796748).
 * <b>`key_fn`</b>: (Optional), (Experimental) A fn that takes in a single entry of `x`
     and returns the corresponding key for coverage calculation. If this is
     `None`, no coverage arm is added to the vocabulary.
+* <b>`fingerprint_shuffle`</b>: (Optional), (Experimental) Whether to sort the
+    vocabularies by fingerprint instead of counts. This is useful for load
+    balancing on the training parameter servers. Shuffle only happens while
+    writing the files, so all the filters above (top_k, frequency_threshold,
+    etc) will still take effect.
 * <b>`name`</b>: (Optional) A name for this operation.