You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
323 lines
9.6 KiB
323 lines
9.6 KiB
{
|
|
"cells": [
|
|
{
|
|
"cell_type": "markdown",
|
|
"source": [
|
|
"# Data Science u oblaku: Način \"Azure ML SDK\"\n",
|
|
"\n",
|
|
"## Uvod\n",
|
|
"\n",
|
|
"U ovom ćemo bilježniku naučiti kako koristiti Azure ML SDK za treniranje, implementaciju i korištenje modela putem Azure ML-a.\n",
|
|
"\n",
|
|
"Preduvjeti:\n",
|
|
"1. Kreirali ste Azure ML radni prostor.\n",
|
|
"2. Učitali ste [Heart Failure dataset](https://www.kaggle.com/andrewmvd/heart-failure-clinical-data) u Azure ML.\n",
|
|
"3. Prenijeli ste ovaj bilježnik u Azure ML Studio.\n",
|
|
"\n",
|
|
"Sljedeći koraci su:\n",
|
|
"\n",
|
|
"1. Kreirajte Eksperiment u postojećem Radnom prostoru.\n",
|
|
"2. Kreirajte Compute klaster.\n",
|
|
"3. Učitajte skup podataka.\n",
|
|
"4. Konfigurirajte AutoML koristeći AutoMLConfig.\n",
|
|
"5. Pokrenite AutoML eksperiment.\n",
|
|
"6. Istražite rezultate i pronađite najbolji model.\n",
|
|
"7. Registrirajte najbolji model.\n",
|
|
"8. Implementirajte najbolji model.\n",
|
|
"9. Koristite krajnju točku.\n",
|
|
"\n",
|
|
"## Specifični uvozi za Azure Machine Learning SDK\n"
|
|
],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"source": [
|
|
"from azureml.core import Workspace, Experiment\n",
|
|
"from azureml.core.compute import AmlCompute\n",
|
|
"from azureml.train.automl import AutoMLConfig\n",
|
|
"from azureml.widgets import RunDetails\n",
|
|
"from azureml.core.model import InferenceConfig, Model\n",
|
|
"from azureml.core.webservice import AciWebservice"
|
|
],
|
|
"outputs": [],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "markdown",
|
|
"source": [
|
|
"## Inicijalizacija radnog prostora\n",
|
|
"Inicijalizirajte objekt radnog prostora iz spremljene konfiguracije. Provjerite je li konfiguracijska datoteka prisutna na .\\config.json\n"
|
|
],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"source": [
|
|
"ws = Workspace.from_config()\n",
|
|
"print(ws.name, ws.resource_group, ws.location, ws.subscription_id, sep = '\\n')"
|
|
],
|
|
"outputs": [],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "markdown",
|
|
"source": [
|
|
"## Kreirajte Azure ML eksperiment\n",
|
|
"\n",
|
|
"Kreirajmo eksperiment pod nazivom 'aml-experiment' u radnom prostoru koji smo upravo inicijalizirali.\n"
|
|
],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"source": [
|
|
"experiment_name = 'aml-experiment'\n",
|
|
"experiment = Experiment(ws, experiment_name)\n",
|
|
"experiment"
|
|
],
|
|
"outputs": [],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "markdown",
|
|
"source": [
|
|
"## Kreirajte klaster za računalnu obradu\n",
|
|
"Morat ćete kreirati [cilj za računalnu obradu](https://docs.microsoft.com/azure/machine-learning/concept-azure-machine-learning-architecture#compute-target) za vaše AutoML pokretanje.\n"
|
|
],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"source": [
|
|
"aml_name = \"heart-f-cluster\"\n",
|
|
"try:\n",
|
|
" aml_compute = AmlCompute(ws, aml_name)\n",
|
|
" print('Found existing AML compute context.')\n",
|
|
"except:\n",
|
|
" print('Creating new AML compute context.')\n",
|
|
" aml_config = AmlCompute.provisioning_configuration(vm_size = \"Standard_D2_v2\", min_nodes=1, max_nodes=3)\n",
|
|
" aml_compute = AmlCompute.create(ws, name = aml_name, provisioning_configuration = aml_config)\n",
|
|
" aml_compute.wait_for_completion(show_output = True)\n",
|
|
"\n",
|
|
"cts = ws.compute_targets\n",
|
|
"compute_target = cts[aml_name]"
|
|
],
|
|
"outputs": [],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "markdown",
|
|
"source": [
|
|
"## Podaci\n",
|
|
"Provjerite jeste li učitali skup podataka u Azure ML i da je ključ istog naziva kao i skup podataka.\n"
|
|
],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"source": [
|
|
"key = 'heart-failure-records'\n",
|
|
"dataset = ws.datasets[key]\n",
|
|
"df = dataset.to_pandas_dataframe()\n",
|
|
"df.describe()"
|
|
],
|
|
"outputs": [],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "markdown",
|
|
"source": [
|
|
"## Konfiguracija AutoML\n"
|
|
],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"source": [
|
|
"automl_settings = {\n",
|
|
" \"experiment_timeout_minutes\": 20,\n",
|
|
" \"max_concurrent_iterations\": 3,\n",
|
|
" \"primary_metric\" : 'AUC_weighted'\n",
|
|
"}\n",
|
|
"\n",
|
|
"automl_config = AutoMLConfig(compute_target=compute_target,\n",
|
|
" task = \"classification\",\n",
|
|
" training_data=dataset,\n",
|
|
" label_column_name=\"DEATH_EVENT\",\n",
|
|
" enable_early_stopping= True,\n",
|
|
" featurization= 'auto',\n",
|
|
" debug_log = \"automl_errors.log\",\n",
|
|
" **automl_settings\n",
|
|
" )"
|
|
],
|
|
"outputs": [],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "markdown",
|
|
"source": [
|
|
"## AutoML Pokretanje\n"
|
|
],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"source": [
|
|
"remote_run = experiment.submit(automl_config)"
|
|
],
|
|
"outputs": [],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"source": [
|
|
"RunDetails(remote_run).show()"
|
|
],
|
|
"outputs": [],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "markdown",
|
|
"source": [],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"source": [
|
|
"best_run, fitted_model = remote_run.get_output()"
|
|
],
|
|
"outputs": [],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"source": [
|
|
"best_run.get_properties()"
|
|
],
|
|
"outputs": [],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"source": [
|
|
"model_name = best_run.properties['model_name']\n",
|
|
"script_file_name = 'inference/score.py'\n",
|
|
"best_run.download_file('outputs/scoring_file_v_1_0_0.py', 'inference/score.py')\n",
|
|
"description = \"aml heart failure project sdk\"\n",
|
|
"model = best_run.register_model(model_name = model_name,\n",
|
|
" description = description,\n",
|
|
" tags = None)"
|
|
],
|
|
"outputs": [],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "markdown",
|
|
"source": [
|
|
"## Implementirajte najbolji model\n",
|
|
"\n",
|
|
"Pokrenite sljedeći kod za implementaciju najboljeg modela. Status implementacije možete vidjeti u Azure ML portalu. Ovaj korak može potrajati nekoliko minuta.\n"
|
|
],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"source": [
|
|
"inference_config = InferenceConfig(entry_script=script_file_name, environment=best_run.get_environment())\n",
|
|
"\n",
|
|
"aciconfig = AciWebservice.deploy_configuration(cpu_cores = 1,\n",
|
|
" memory_gb = 1,\n",
|
|
" tags = {'type': \"automl-heart-failure-prediction\"},\n",
|
|
" description = 'Sample service for AutoML Heart Failure Prediction')\n",
|
|
"\n",
|
|
"aci_service_name = 'automl-hf-sdk'\n",
|
|
"aci_service = Model.deploy(ws, aci_service_name, [model], inference_config, aciconfig)\n",
|
|
"aci_service.wait_for_deployment(True)\n",
|
|
"print(aci_service.state)"
|
|
],
|
|
"outputs": [],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "markdown",
|
|
"source": [
|
|
"## Korištenje krajnje točke\n",
|
|
"Možete dodati ulaze u sljedeći primjer ulaza.\n"
|
|
],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"source": [
|
|
"data = {\n",
|
|
" \"data\":\n",
|
|
" [\n",
|
|
" {\n",
|
|
" 'age': \"60\",\n",
|
|
" 'anaemia': \"false\",\n",
|
|
" 'creatinine_phosphokinase': \"500\",\n",
|
|
" 'diabetes': \"false\",\n",
|
|
" 'ejection_fraction': \"38\",\n",
|
|
" 'high_blood_pressure': \"false\",\n",
|
|
" 'platelets': \"260000\",\n",
|
|
" 'serum_creatinine': \"1.40\",\n",
|
|
" 'serum_sodium': \"137\",\n",
|
|
" 'sex': \"false\",\n",
|
|
" 'smoking': \"false\",\n",
|
|
" 'time': \"130\",\n",
|
|
" },\n",
|
|
" ],\n",
|
|
"}\n",
|
|
"\n",
|
|
"test_sample = str.encode(json.dumps(data))"
|
|
],
|
|
"outputs": [],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"source": [
|
|
"response = aci_service.run(input_data=test_sample)\n",
|
|
"response"
|
|
],
|
|
"outputs": [],
|
|
"metadata": {}
|
|
},
|
|
{
|
|
"cell_type": "markdown",
|
|
"metadata": {},
|
|
"source": [
|
|
"\n---\n\n**Odricanje od odgovornosti**: \nOvaj dokument je preveden pomoću AI usluge za prevođenje [Co-op Translator](https://github.com/Azure/co-op-translator). Iako nastojimo osigurati točnost, imajte na umu da automatski prijevodi mogu sadržavati pogreške ili netočnosti. Izvorni dokument na izvornom jeziku treba smatrati autoritativnim izvorom. Za ključne informacije preporučuje se profesionalni prijevod od strane čovjeka. Ne preuzimamo odgovornost za bilo kakva nesporazuma ili pogrešna tumačenja koja proizlaze iz korištenja ovog prijevoda.\n"
|
|
]
|
|
}
|
|
],
|
|
"metadata": {
|
|
"orig_nbformat": 4,
|
|
"language_info": {
|
|
"name": "python"
|
|
},
|
|
"coopTranslator": {
|
|
"original_hash": "af42669556d5dc19fc4cc3866f7d2597",
|
|
"translation_date": "2025-09-01T20:07:28+00:00",
|
|
"source_file": "5-Data-Science-In-Cloud/19-Azure/notebook.ipynb",
|
|
"language_code": "hr"
|
|
}
|
|
},
|
|
"nbformat": 4,
|
|
"nbformat_minor": 2
|
|
} |