Skip to content

Commit a379f92

Browse files
20220909 restrict BDC to the SQL 2019 TOC
1 parent c081a23 commit a379f92

13 files changed

Lines changed: 156 additions & 205 deletions

.openpublishing.redirection.json

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -59455,6 +59455,11 @@
5945559455
"redirect_url": "/sql/tools/sqlpackage/troubleshooting-issues-and-performance-with-sqlpackage",
5945659456
"redirect_document_id": false
5945759457
}
59458+
{
59459+
"source_path": "docs/sql-server/partner-big-data-cluster.md",
59460+
"redirect_url": "/sql/big-data-cluster/partner-big-data-cluster",
59461+
"redirect_document_id": false
59462+
}
5945859463
]
5945959464
}
5946059465

docs/big-data-cluster/index.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -235,13 +235,13 @@ landingContent:
235235
- linkListType: reference
236236
links:
237237
- text: Integration partners
238-
url: ../sql-server/partner-big-data-cluster.md
238+
url: partner-big-data-cluster.md
239239
- text: Apache Spark
240240
url: /azure/hdinsight/spark/apache-spark-overview
241241
- text: Samples (GitHub)
242242
url: https://github.com/microsoft/sql-server-samples/tree/master/samples/features/sql-big-data-cluster
243243
- text: Solution partners
244-
url: ../sql-server/partner-big-data-cluster.md
244+
url: partner-big-data-cluster.md
245245
- linkListType: learn
246246
links:
247247
- text: Introduction to Kubernetes

docs/big-data-cluster/machine-learning-services.md

Lines changed: 136 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -1,30 +1,30 @@
11
---
22
title: Machine Learning Services (Python, R)
33
titleSuffix: SQL Server Big Data Clusters
4-
description: Learn how you can run Python and R scripts on the master instance of a SQL Server Big Data Clusters with Machine Learning Services.
4+
description: Learn how you can run Python and R scripts on the master instance of a SQL Server 2019 Big Data Clusters with Machine Learning Services.
55
author: WilliamDAssafMSFT
66
ms.author: wiassaf
77
ms.reviewer: wiassaf
8-
ms.date: 11/19/2020
8+
ms.date: 09/09/2022
99
ms.prod: sql
1010
ms.technology: machine-learning-bdc
1111
ms.topic: conceptual
1212
---
1313

14-
# Run Python and R scripts with Machine Learning Services on SQL Server Big Data Clusters
14+
# Run Python and R scripts with Machine Learning Services on SQL Server 2019 Big Data Clusters
1515

1616
[!INCLUDE[SQL Server 2019](../includes/applies-to-version/sqlserver2019.md)]
1717

1818
[!INCLUDE[big-data-clusters-banner-retirement](../includes/bdc-banner-retirement.md)]
1919

2020
You can run Python and R scripts on the master instance of [SQL Server Big Data Clusters](big-data-cluster-overview.md) with [Machine Learning Services](../machine-learning/index.yml).
2121

22-
> [!NOTE]
22+
> [!NOTE]
2323
> You can also run Java code on the master instance of SQL Server Big Data Clusters with the [Java Language Extension](../language-extensions/java-overview.md). Following the steps below will also enable [SQL Server Language Extensions](../language-extensions/language-extensions-overview.md).
2424
2525
## Enable Machine Learning Services
2626

27-
Machine Learning Services is installed by default on Big Data Clusters and does not require separate installation.
27+
Machine Learning Services is installed by default on SQL Server 2019 Big Data Clusters and does not require separate installation.
2828

2929
To enable Machine Learning Services, run this statement on the master instance:
3030

@@ -36,7 +36,7 @@ GO
3636

3737
You are now ready to run Python and R scripts on the master instance of Big Data Clusters. See the quickstarts under [Next steps](#next-steps) to run your first script.
3838

39-
>[!NOTE]
39+
> [!NOTE]
4040
>The configuration setting cannot be set on an availability group listener connection. If Big Data Clusters is deployed with high availability, the set `external scripts enabled` on each replica. See [Enable on cluster with high availability](#enable-on-cluster-with-high-availability).
4141
4242
## Enable on cluster with high availability
@@ -47,7 +47,7 @@ The following section describes how to enable external scripts on each instance.
4747

4848
### Create an external load balancer for each instance
4949

50-
For each replica on the availability group, create a load balancer to allow you to connect to the instance.
50+
For each replica on the availability group, create a load balancer to allow you to connect to the instance.
5151

5252
`kubectl expose pod <pod-name> --port=<connection port number> --name=<load-balancer-name> --type=LoadBalancer -n <kubernetes namespace>`
5353

@@ -61,9 +61,9 @@ The examples in this article use the following values:
6161
Update the following script for your environment, and run the commands:
6262

6363
```bash
64-
kubectl expose pod master-0 --port=1533 --name=mymaster-0 --type=LoadBalancer -n mssql-cluster
64+
kubectl expose pod master-0 --port=1533 --name=mymaster-0 --type=LoadBalancer -n mssql-cluster
6565
kubectl expose pod master-1 --port=1533 --name=mymaster-1 --type=LoadBalancer -n mssql-cluster
66-
kubectl expose pod master-2 --port=1533 --name=mymaster-2 --type=LoadBalancer -n mssql-cluster
66+
kubectl expose pod master-2 --port=1533 --name=mymaster-2 --type=LoadBalancer -n mssql-cluster
6767
```
6868

6969
`kubectl` returns the following output.
@@ -80,7 +80,7 @@ Each load balancer is a master replica endpoint.
8080

8181
1. Get the IP address for the master replica endpoint.
8282

83-
The following command returns the external IP address for the replica endpoint.
83+
The following command returns the external IP address for the replica endpoint.
8484

8585
`kubectl get services <load-balancer-name> -n <kubernetes namespace>`
8686

@@ -92,7 +92,7 @@ Each load balancer is a master replica endpoint.
9292
kubectl get services mymaster-2 -n mssql-cluster
9393
```
9494

95-
>[!NOTE]
95+
> [!NOTE]
9696
> It may take a little time before the external IP address is available. Run the preceding script periodically until each endpoint returns an external IP address.
9797
9898
1. Connect to the master replica endpoint and enable script execution.
@@ -117,7 +117,7 @@ Each load balancer is a master replica endpoint.
117117

118118
The following image demonstrates this process.
119119

120-
[![Demo](media/machine-learning-services/example-kube-enable-scripts.png "Demonstrate enable feature on Kubernetes")](media/machine-learning-services/example-kube-enable-scripts.png#lightbox)
120+
:::image type="content" source="media/machine-learning-services/example-kube-enable-scripts.png" alt-text="A screenshot of the command prompt providing a demo of the steps necessary to enable external scripts." lightbox="media/machine-learning-services/example-kube-enable-scripts.png" :::
121121

122122
You are now ready to run Python and R scripts on the master instance of Big Data Clusters. See the quickstarts under [Next steps](#next-steps) to run your first script.
123123

@@ -137,9 +137,129 @@ kubectl delete svc mymaster-1 -n mssql-cluster
137137
kubectl delete svc mymaster-2 -n mssql-cluster
138138
```
139139

140+
## SQL Server Big Data Clusters machine learning quickstarts
141+
142+
### Python quickstarts
143+
144+
- [Run Python scripts](../machine-learning/tutorials/quickstart-python-create-script.md?view=sql-server-ver15&preserve-view=true)
145+
- [Data structures and objects](../machine-learning/tutorials/quickstart-python-data-structures.md?view=sql-server-ver15&preserve-view=true)
146+
- [Python functions](../machine-learning/tutorials/quickstart-python-functions.md?view=sql-server-ver15&preserve-view=true)
147+
- [Train and score a model](../machine-learning/tutorials/quickstart-python-train-score-model.md?view=sql-server-ver15&preserve-view=true)
148+
149+
### R quickstarts
150+
151+
- [Run R scripts](../machine-learning/tutorials/quickstart-r-create-script.md?view=sql-server-ver15&preserve-view=true)
152+
- [Data types and objects](../machine-learning/tutorials/quickstart-r-data-types-and-objects.md?view=sql-server-ver15&preserve-view=true)
153+
- [R functions](../machine-learning/tutorials/quickstart-r-functions.md?view=sql-server-ver15&preserve-view=true)
154+
- [Train and score a model](../machine-learning/tutorials/quickstart-r-train-score-model.md?view=sql-server-ver15&preserve-view=true)
155+
156+
## SQL Server Big Data Clusters machine learning tutorials
157+
158+
### Python tutorial
159+
160+
#### Ski rental (linear regression)
161+
162+
- [1 - Introduction](../machine-learning/tutorials/python-ski-rental-linear-regression.md?view=sql-server-ver15&preserve-view=true)
163+
- [2 - Prepare data](../machine-learning/tutorials/python-ski-rental-linear-regression-prepare-data.md?view=sql-server-ver15&preserve-view=true)
164+
- [3 - Train model](../machine-learning/tutorials/python-ski-rental-linear-regression-train-model.md?view=sql-server-ver15&preserve-view=true)
165+
- [4 - Deploy model](../machine-learning/tutorials/python-ski-rental-linear-regression-deploy-model.md?view=sql-server-ver15&preserve-view=true)
166+
167+
#### Categorize customers (k-means clustering)
168+
169+
- [1 - Introduction](../machine-learning/tutorials/python-clustering-model.md?view=sql-server-ver15&preserve-view=true)
170+
- [2 - Prepare the data](../machine-learning/tutorials/python-clustering-model-prepare-data.md?view=sql-server-ver15&preserve-view=true)
171+
- [3 - Create the model](../machine-learning/tutorials/python-clustering-model-build.md?view=sql-server-ver15&preserve-view=true)
172+
- [4 - Deploy the model](../machine-learning/tutorials/python-clustering-model-deploy.md?view=sql-server-ver15&preserve-view=true)
173+
174+
#### NYC taxi tips (classification)
175+
176+
- [1 - Introduction](../machine-learning/tutorials/python-taxi-classification-introduction.md?view=sql-server-ver15&preserve-view=true)
177+
- [2 - Data exploration](../machine-learning/tutorials/python-taxi-classification-explore-data.md?view=sql-server-ver15&preserve-view=true)
178+
- [3 - Feature engineering](../machine-learning/tutorials/python-taxi-classification-create-features.md?view=sql-server-ver15&preserve-view=true)
179+
- [4 - Train and deploy](../machine-learning/tutorials/python-taxi-classification-train-model.md?view=sql-server-ver15&preserve-view=true)
180+
- [5 - Predictions](../machine-learning/tutorials/python-taxi-classification-deploy-model.md?view=sql-server-ver15&preserve-view=true)
181+
182+
### R tutorials
183+
184+
#### Ski rental (decision tree)
185+
186+
- [1 - Introduction](../machine-learning/tutorials/r-predictive-model-introduction.md?view=sql-server-ver15&preserve-view=true)
187+
- [2 - Prepare data](../machine-learning/tutorials/r-predictive-model-prepare-data.md?view=sql-server-ver15&preserve-view=true)
188+
- [3 - Train model](../machine-learning/tutorials/r-predictive-model-train.md?view=sql-server-ver15&preserve-view=true)
189+
- [4 - Deploy model](../machine-learning/tutorials/r-predictive-model-deploy.md?view=sql-server-ver15&preserve-view=true)
190+
191+
#### Categorize customers (k-means clustering)
192+
193+
- [1 - Introduction](../machine-learning/tutorials/r-clustering-model-introduction.md?view=sql-server-ver15&preserve-view=true)
194+
- [2 - Prepare the data](../machine-learning/tutorials/r-clustering-model-prepare-data.md?view=sql-server-ver15&preserve-view=true)
195+
- [3 - Create the model](../machine-learning/tutorials/r-clustering-model-build.md?view=sql-server-ver15&preserve-view=true)
196+
- [4 - Deploy the model](../machine-learning/tutorials/r-clustering-model-deploy.md?view=sql-server-ver15&preserve-view=true)
197+
198+
#### NYC taxi tips (classification)
199+
200+
- [1 - Introduction](../machine-learning/tutorials/r-taxi-classification-introduction.md?view=sql-server-ver15&preserve-view=true)
201+
- [2 - Data exploration](../machine-learning/tutorials/r-taxi-classification-explore-data.md?view=sql-server-ver15&preserve-view=true)
202+
- [3 - Feature engineering](../machine-learning/tutorials/r-taxi-classification-create-features.md?view=sql-server-ver15&preserve-view=true)
203+
- [4 - Train and deploy](../machine-learning/tutorials/r-taxi-classification-train-model.md?view=sql-server-ver15&preserve-view=true)
204+
- [5 - Predictions](../machine-learning/tutorials/r-taxi-classification-deploy-model.md?view=sql-server-ver15&preserve-view=true)
205+
206+
## SQL Server Big Data Clusters machine learning how-to guides
207+
208+
### Data exploration and modeling
209+
210+
- [Plot Histogram in Python](../machine-learning/data-exploration/python-plot-histogram.md?view=sql-server-ver15&preserve-view=true)
211+
- [Import data into pandas dataframe](../machine-learning/data-exploration/python-dataframe-pandas.md?view=sql-server-ver15&preserve-view=true)
212+
- [Insert dataframe into SQL](../machine-learning/data-exploration/python-dataframe-sql-server.md?view=sql-server-ver15&preserve-view=true)
213+
214+
### Data type conversions
215+
216+
- [Python to SQL](../machine-learning/python/python-libraries-and-data-types.md?view=sql-server-ver15&preserve-view=true)
217+
- [R to SQL](../machine-learning/r/r-libraries-and-data-types.md?view=sql-server-ver15&preserve-view=true)
218+
219+
### Deploy
220+
221+
- [Operationalize using stored procedures](./machine-learning/tutorials/python-ski-rental-linear-regression-deploy-model.md?preserve-view=true&view=sql-server-ver15)
222+
- [Convert R code for SQL Server](./machine-learning/deploy/modify-r-python-code-to-run-in-sql-server.md?preserve-view=true&view=sql-server-ver15)
223+
224+
### Predictions
225+
226+
- [Native scoring with PREDICT T-SQL](../machine-learning/predictions/native-scoring-predict-transact-sql.md?view=sql-server-ver15&preserve-view=true)
227+
228+
### Package management
229+
230+
#### Install new Python packages
231+
232+
- [Get Python package information](../machine-learning/package-management/python-package-information.md?view=sql-server-ver15&preserve-view=true)
233+
- [Install with sqlmlutils](../machine-learning/package-management/install-additional-python-packages-on-sql-server.md?view=sql-server-ver15&preserve-view=true)
234+
235+
#### Install new R packages
236+
237+
- [Get R package information](../machine-learning/package-management/r-package-information.md?view=sql-server-ver15&preserve-view=true)
238+
- [Install with sqlmlutils](../machine-learning/package-management/install-additional-r-packages-on-sql-server.md?view=sql-server-ver15&preserve-view=true)
239+
- [Create a miniCRAN repo](../machine-learning/package-management/create-a-local-package-repository-using-minicran.md?view=sql-server-ver15&preserve-view=true)
240+
- [Tips for using R packages](../machine-learning/package-management/tips-for-using-r-packages.md?view=sql-server-ver15&preserve-view=true)
241+
242+
### Monitore
243+
244+
- [Monitor using SSMS reports](../machine-learning/administration/monitor-sql-server-machine-learning-services-using-custom-reports-management-studio.md?view=sql-server-ver15&preserve-view=true)
245+
- [Monitor using DMVs](../machine-learning/administration/monitor-sql-server-machine-learning-services-using-dynamic-management-views.md?view=sql-server-ver15&preserve-view=true)
246+
247+
- [Monitor using extended events](../machine-learning/administration/extended-events.md?view=sql-server-ver15&preserve-view=true)
248+
- [Monitor PREDICT T-SQL](../machine-learning/administration/extended-events-predict-tsql.md?view=sql-server-ver15&preserve-view=true)
249+
250+
### Security
251+
252+
- [Give users permission](../machine-learning/security/user-permission.md?view=sql-server-ver15&preserve-view=true)
253+
254+
## Spark Machine Learning
255+
256+
- [Use Spark Machine Learning](spark-machine-learning.md)
257+
- [Data wrangling using PROSE Code Accelerator](use-prose-for-big-data-automation.md)
258+
- [Spark machine learning models with MLeap](spark-create-machine-learning-model.md)
259+
140260
## Next steps
141261

142-
+ [Run simple Python scripts](../machine-learning/tutorials/quickstart-python-create-script.md?toc=/sql/toc.json)
143-
+ [Train and score a predictive model in Python](../machine-learning/tutorials/quickstart-python-train-score-model.md?toc=/sql/toc.json)
144-
+ [Run simple R scripts](../machine-learning/tutorials/quickstart-r-create-script.md?toc=/sql/toc.json)
145-
+ [Train and score a predictive model in R](../machine-learning/tutorials/quickstart-r-train-score-model.md?toc=/sql/toc.json)
262+
- [Run simple Python scripts](../machine-learning/tutorials/quickstart-python-create-script.md?toc=/sql/toc.json)
263+
- [Train and score a predictive model in Python](../machine-learning/tutorials/quickstart-python-train-score-model.md?toc=/sql/toc.json)
264+
- [Run simple R scripts](../machine-learning/tutorials/quickstart-r-create-script.md?toc=/sql/toc.json)
265+
- [Train and score a predictive model in R](../machine-learning/tutorials/quickstart-r-train-score-model.md?toc=/sql/toc.json)

docs/sql-server/partner-big-data-cluster.md renamed to docs/big-data-cluster/partner-big-data-cluster.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ ms.prod: sql
88
ms.technology: big-data-cluster
99
ms.prod_service: "database-engine"
1010
ms.author: wiassaf
11-
ms.reviewer: dacoelho
11+
ms.reviewer: hudequei
1212
author: WilliamDAssafMSFT
1313
---
1414
# SQL Server 2019 Big Data Clusters partners

docs/big-data-cluster/release-notes-big-data-cluster.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -45,14 +45,14 @@ The following table contains the tested configuration matrix for each release of
4545
Restrictions:
4646

4747
* SQL Server 2019 Big Data Clusters is supported as a *workload*. Microsoft provides support for the software components on the containers installed and configured by SQL Server 2019 Big Data Clusters only. Kubernetes itself, and other containers that may influence SQL Server 2019 Big Data Clusters behavior, are not supported by the support team. For Kubernetes support, contact your certified Kubernetes distribution provider.
48-
* SQL Server 2019 Big Data Clusters requires block storage for all persisted volumes. Management operation on top of the persisted volumes created and used by a big data cluster is a capability that depends on the storage provider including, for example, operations to expand persistent volumes (PVs). Reference your specific CSI storage provider documentation or the [partner reference architecture and white papers](../sql-server/partner-big-data-cluster.md).
48+
* SQL Server 2019 Big Data Clusters requires block storage for all persisted volumes. Management operation on top of the persisted volumes created and used by a big data cluster is a capability that depends on the storage provider including, for example, operations to expand persistent volumes (PVs). Reference your specific CSI storage provider documentation or the [partner reference architecture and white papers](partner-big-data-cluster.md).
4949
* The open-source components included by SQL Server 2019 Big Data Clusters are fixed for that particular release and must not be updated or modified.
5050
* Container images are provided "as-is". Composability features of Kubernetes aren't supported. Changing the set of container images in a SQL Server 2019 Big Data Cluster release, or to customize the containers, is not supported.
5151

5252
Reference architecture and white papers for [!INCLUDE[big-data-clusters-nover](../includes/ssbigdataclusters-ss-nover.md)] can be found on the following pages:
5353

5454
* [SQL Server 2019](https://www.microsoft.com/sql-server/sql-server-2019)
55-
* [SQL Server 2019 Big Data Clusters partners](../sql-server/partner-big-data-cluster.md)
55+
* [SQL Server 2019 Big Data Clusters partners](partner-big-data-cluster.md)
5656

5757
## Release history
5858

docs/big-data-cluster/release-notes-cumulative-update-12.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -28,7 +28,7 @@ SQL Server Big Data Clusters CU12 was tested on the following environment combin
2828
Reference Architecture White Papers for SQL Server Big Data Clusters can be found on the following pages:
2929

3030
* https://www.microsoft.com/sql-server/sql-server-2019
31-
* [SQL Server Big Data Clusters partners](../sql-server/partner-big-data-cluster.md)
31+
* [SQL Server Big Data Clusters partners](partner-big-data-cluster.md)
3232

3333
## System environment
3434

docs/big-data-cluster/release-notes-cumulative-update-13.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -48,7 +48,7 @@ For detailed SQL Server engine changes, check the [official SQL Server CU13 know
4848
Reference Architecture White Papers for [!INCLUDE[big-data-clusters-nover](../includes/ssbigdataclusters-ss-nover.md)] can be found on the following pages:
4949

5050
* [SQL Server 2019](https://www.microsoft.com/sql-server/sql-server-2019)
51-
* [SQL Server Big Data Clusters partners](../sql-server/partner-big-data-cluster.md)
51+
* [SQL Server Big Data Clusters partners](partner-big-data-cluster.md)
5252

5353

5454
## System environment

docs/big-data-cluster/release-notes-cumulative-update-14.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -43,7 +43,7 @@ For detailed SQL Server engine changes, check the [official SQL Server CU14 know
4343
Reference Architecture White Papers for [!INCLUDE[big-data-clusters-nover](../includes/ssbigdataclusters-ss-nover.md)] can be found on the following pages:
4444

4545
* [SQL Server 2019](https://www.microsoft.com/sql-server/sql-server-2019)
46-
* [SQL Server Big Data Clusters partners](../sql-server/partner-big-data-cluster.md)
46+
* [SQL Server Big Data Clusters partners](partner-big-data-cluster.md)
4747

4848

4949
## System environment

docs/big-data-cluster/release-notes-cumulative-update-15.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -43,7 +43,7 @@ For detailed SQL Server engine changes, check the [official SQL Server CU15 know
4343
Reference Architecture White Papers for [!INCLUDE[big-data-clusters-nover](../includes/ssbigdataclusters-ss-nover.md)] can be found on the following pages:
4444

4545
* [SQL Server 2019](https://www.microsoft.com/sql-server/sql-server-2019)
46-
* [SQL Server Big Data Clusters partners](../sql-server/partner-big-data-cluster.md)
46+
* [SQL Server Big Data Clusters partners](partner-big-data-cluster.md)
4747

4848
## System environment
4949

0 commit comments

Comments
 (0)