Chii chinonzi synthetic data?

A crash course synthetic data

 

 

ziviso

Chii chinonzi synthetic data?

Mhinduro yacho iri nyore. Nepo data rekutanga rinounganidzwa mukudyidzana kwako nevanhu chaivo (semuenzaniso vatengi, varwere, vashandi nezvimwewo) uye kuburikidza nemaitiro ako ese emukati, data rekugadzira rinogadzirwa nekombuta algorithm. Iyi algorithm yekombuta inogadzira zvizere zvitsva uye zvekugadzira datapoints.

Gadzirisa matambudziko ekuvanzika kwedata

Synthetically yakagadzirwa data ine nyowani nyowani uye yekugadzira datapoints isina hukama-kune-umwe kune yekutanga data. Nekudaro, hapana kana imwe yemasynthetic datapoints inogona kuteedzerwa kumashure kana kudzoreredza mainjiniya kune yekutanga data. Nekuda kweizvozvo, data rekugadzira haribvumidzwe kubva pamitemo yekuvanzika, senge GDPR uye inoshanda semhinduro yekugadzirisa nekukunda matambudziko ekuvanzika kwedata.

Wedzera uye tevedzera

Iyo inogadzirwa chikamu chekugadzirwa kwedata rekugadzira inobvumira kuwedzera uye kutevedzera yakazara data nyowani. Izvi zvinoshanda semhinduro kana usina data rakakwana (kushomeka kwedata), ungada kukwidza-sample-makesi-kesi kana iwe usati uine data.

Pano, tarisiro yeSyntho yakarongedzwa dhata (dhata yakamisikidzwa mumatafura ane mitsara nemakoramu, sezvaunoona muExcel machira), asi isu tinogara tichida kuenzanisira pfungwa yedata rekugadzira kuburikidza nemifananidzo, nekuti inonakidza.

Mhando dze data rekugadzira

Mhando nhatu dzedata rekugadzira dziripo mukati meiyo synthetic data amburera. Aya marudzi matatu e data rekugadzira ndeaya: dummy data, mutemo-based yakagadzirwa synthetic data uye synthetic data inogadzirwa nehungwaru hwekugadzira (AI). Isu tinotsanangura munguva pfupi kuti marudzi matatu akasiyana e data rekugadzira chii.

Dummy data / mock data

Dummy data inogadzirwa zvisina tsarukano (semuenzaniso nejenareta re data).

Nekuda kweizvozvo, hunhu, hukama uye manhamba mapatani ari mudhata rekutanga haana kuchengetedzwa, kutorwa uye kudhindwazve muyakagadzirwa dummy data. Nekudaro, iyo inomiririra yedhimmy data / mock data ishoma mukuenzanisa neyekutanga data.

  • Nguva yekuishandisa: kutsiva yakananga identifiers (PII) kana kana iwe usina data (zvakadaro) uye usingade kushandisa nguva nesimba pakutsanangura mitemo.

Rule-yakavakirwa inogadzira data rekugadzira

Rule-based inogadzirwa synthetic data ndeye synthetic data inogadzirwa neyakafanotsanangurwa seti yemitemo. Mienzaniso yemitemo yakafanotsanangurwa inogona kunge iri yekuti iwe ungade kuve nedhata rekugadzira rine hushoma hukoshi, kukosha kwepamusoro kana kukosha kwepakati. Chero hupi hwehumiro, hukama uye manhamba maitiro, ayo iwe aungade kuburitsa mumutemo-wakavakirwa wakagadzirwa synthetic data, inoda kufanotsanangurwa.

Nekuda kweizvozvo, iyo data data ichave yakanaka seyakafanotsanangurwa seti yemitemo. Izvi zvinoguma nematambudziko kana mhando yepamusoro yedata iri yezvakakosha. Kutanga, munhu anogona kutsanangura chete akaganhurirwa seti yemitemo inotorwa mu data rekugadzira. Pamusoro pezvo, kumisikidza mitemo yakawanda kunozokonzera kupindirana uye kupokana mitemo. Uyezve, hauzombofi wakavhara zvizere mitemo yese yakakodzera. Uyezve, panogona kunge paine mitemo yakakodzera yausingatomboziva nezvayo. Uye pakupedzisira (uye kwete kukanganwa), izvi zvinokutora iwe nguva yakawanda uye simba zvinoguma nemhinduro isingashande.

  • Nguva yekuishandisa: kana iwe usina data (zvakadaro)

Synthetic data inogadzirwa nehungwaru hwekugadzira (AI)

Sezvaunotarisira kubva pazita, data rekugadzira rinogadzirwa nehungwaru hwekugadzira (AI) idata rekugadzira rinogadzirwa neiyo artificial intelligence (AI) algorithm. Iyo AI modhi inodzidziswa pane yekutanga data kuti idzidze ese maitiro, hukama uye manhamba maitiro. Mushure meizvozvo, iyi AI algorithm inokwanisa kugadzira datapoints nyowani uye modhi iwo matsva datapoints nenzira yekuti inoburitsa hunhu, hukama uye manhamba maitiro kubva kune yekutanga dataset. Izvi ndizvo zvatinodaidza kuti synthetic data twin.

Iyo AI modhi inotevedzera yepakutanga data kugadzira synthetic data mapatya anogona kushandiswa se-kana iri yepakutanga data. Iyi inovhura dzakasiyana siyana dzekushandisa apo iyo AI yakagadzira synthetic data inogona kushandiswa seimwe nzira yekushandisa yekutanga (sensitive) data, sekushandiswa kweAI inogadzirwa data rekugadzira se data rekuyedza, demo data kana analytics.

Kuona kuti synthetic data inogadzirwa sei

Mukuenzanisa nekutonga-kwakavakirwa kwakagadzirwa data data: pachinzvimbo chekuti iwe udzidze uye kutsanangura yakakodzera mitemo, iyo AI algorithm inokuitira izvi otomatiki. Pano, kwete chete maitiro, hukama uye maitiro ehuwandu ayo iwe aunoziva achavharwa, zvakare maitiro, hukama uye nhamba dzezviverengero zvausina kutomboziva zvichavharwa.

  • Nguva yekuishandisa: kana uine (rimwe) dhata seyekuisa yekutevedzera kana kushandisa sepokutangira kune yakangwara data kugadzira uye kuwedzera maficha.

Ndeupi rudzi rwe data rekugadzira rekushandisa?

Zvichienderana nekushandisa-kesi yako, musanganiswa wedhata-dhata / mock data, mutemo-wakavakirwa wakagadzirwa synthetic data kana synthetic data inogadzirwa nehungwaru hwekugadzira (AI) inorairwa. Iyi yekutarisisa inokupa iwe yekutanga chiratidzo chekuti ndeupi rudzi rwe data rekugadzira rekushandisa. Sezvo Syntho achitsigira ese, inzwa wakasununguka kubata nyanzvi dzedu kuti dzinyure yako yekushandisa-kesi nesu.

Chati ichi chinopa marudzi akasiyana e data rekugadzira

syntho guide cover

Sevha yako synthetic data gwara izvozvi!