You need to create an inferencing cluster. Deploy the real-time endpoint After your AKS service has finished provisioning, return to the real-time inferencing pipeline to complete deployment. * Select Deploy above the canvas. * Select Deploy new real-time endpoint. * Select the AKS cluster you created. * Select Deploy. Reference: https://docs.microsoft.com/en-us/azure/machine-learning/tutorial-designer-automobile-price-deploy