Qhagamshela Ngubani? Imizekelo emi-5 yokuba kutheni ukususa amagama akuyiyo inketho

qikelela ukuba ngubani umdlalo

Isingeniso kuQagela Ngubani

Qhagamshela Ngubani? Nangona ndiqinisekile ukuba uninzi lwenu niyazi lo mdlalo ukusuka ngasemva ngeentsuku, nantsi impinda emfutshane. Injongo yomdlalo: ukufumanisa igama lomlingisi wekhathuni okhethwe ngumchasi wakho ngokubuza imibuzo ngo-'ewe 'no' hayi ', njengokuthi' umntu unxibe umnqwazi? ' okanye 'umntu ufaka iiglasi'? Abadlali bashenxisa abaviwa ngokusekwe kwimpendulo yomchasi kwaye bafunde iimpawu ezinxulumene nemfihlakalo yomchasi wabo. Umdlali wokuqala obala omnye umdlali oyimfihlakalo ophumeleleyo kumdlalo.

Unayo. Umntu kufuneka achonge umntu ngaphandle kwedathasethi ngokufikelela kuphela kwiimpawu ezihambelanayo. Ngapha koko, sihlala siyibona le nto yokuba ngubani ofake isicelo sokuziqhelanisa, kodwa emva koko waqeshwa kwiidatha ezifomathiweyo ezinemiqolo kunye neekholamu ezinempawu zabantu bokwenyani. Umahluko ophambili xa usebenza nedatha kukuba abantu bathambekele ekuthatheni lula ubukho babantu bokwenyani abanokuthi babhengezwe ngokufikelela kuphela kwiimpawu ezimbalwa.

Njengoko ucinga ukuba ngubani umdlalo obonisayo, umntu othile unokuchonga abantu ngokufikelela kuphela kwiimpawu ezimbalwa. Isebenza njengomzekelo olula wokuba kutheni ususa 'amagama' kuphela (okanye ezinye izinto ezichazayo) kwindawo yakho yedatha isilela njengenkqubo yokuchaza amagama. Kule bhlog, sinikezela ngamatyala amane asebenzayo ukukwazisa malunga nomngcipheko wabucala onxulumene nokususwa kweekholamu njengendlela yokuchazwa kwedatha.

2) Uhlaselo lokunxibelelana: idathasethi yakho enxulunyaniswe neminye imithombo yoluntu (yoluntu)

Umngcipheko wokuhlaselwa konxibelelwano sesona sizathu sibalulekileyo sokuba kususwe kuphela amagama kungasebenzi (kwakhona) njengendlela yokuchazwa. Ngohlaselo lonxibelelaniso, umhlaseli udibanisa idatha yoqobo kunye neminye imithombo yedatha efikelelekayo ukuze achonge ngokukodwa umntu kwaye afunde (ulwazi oluhlala lubuthathaka) malunga nalo mntu.

Eyona nto iphambili apha kukufumaneka kwezinye izibonelelo zedatha ezikhoyo ngoku, okanye ezinokubakho kwixesha elizayo. Cinga ngawe. Ingakanani idatha yakho yobuqu enokufunyanwa kuFacebook, kwi-Instagram okanye kwi-LinkedIn enokuthi isetyenziswe gwenxa kuhlaselo lonxibelelwano?

Kwiintsuku ezidlulileyo, ukubakho kwedatha bekuncinci kakhulu, nto leyo ichaza ukuba kutheni ukususwa kwamagama kwakwanele ukugcina imfihlo yabantu. Idatha efumanekayo ithetha ukuba amathuba ambalwa okudibanisa idatha. Nangona kunjalo, ngoku (sisebenza) abathathi-nxaxheba kuqoqosho oluqhutywa yidatha, apho inani ledatha likhula kwinqanaba lokucacisa. Idatha engaphezulu, kunye nokuphucula itekhnoloji yokuqokelela idatha kuya kukhokelela ekwandeni kokuhlaselwa konxibelelwano. Yintoni umntu anokuyibhala kwiminyaka eli-10 ngomngcipheko wokuhlaselwa ngonxibelelwano?

Umzekeliso 1

Idatha ekhulayo ngokucacileyo iyinyani

Inani ledatha

Into yokuphonononga

U-Sweeney (2002) ubonakalisile kwiphepha lezemfundo ukuba ukwazile njani ukubuyisa kunye nokufumana idatha yezonyango evela kubantu abathile ngokusekwe ekunxibelelaniseni iseti yedatha efumanekayo 'yokundwendwela isibhedlele' kumbhalisi wokuvota ofumaneka esidlangalaleni eMelika. Zombini iiseti zedatha apho kuthathwa njengongachazwanga ngokufanelekileyo ngokususwa kwamagama kunye nezinye izinto ezichongiweyo.

Umzekeliso 2

Uhlaselo loqhakamshelwano

Uhlaselo loqhakamshelwano

Ngokusekwe kuphela kwiiparamitha ezintathu (1) iKhowudi yeZip, (2) isini kunye (3) noMhla wokuzalwa, ubonakalisile ukuba i-87% yabantu base-US banokuchongwa kwakhona ngokudibanisa amanqaku achazwe apha ngasentla kuzo zombini iiseti zedatha. U-Sweeney waphinda umsebenzi wakhe wokuba 'nelizwe' njengenye ye 'Zip Code'. Ukongeza, ubonakalisile ukuba i-18% yabantu base-US banokuchongwa kuphela ngokufikelela kwidathasethi enolwazi malunga (1) nelizwe lasekhaya, (2) isini kunye (3) nomhla wokuzalwa. Cinga ngemithombo yoluntu ekhankanywe apha ngasentla, njenge-Facebook, i-LinkedIn okanye i-Instagram. Ngaba ilizwe lakho, isini kunye nomhla wokuzalwa uyabonakala, okanye abanye abasebenzisi bayakwazi ukuyikhupha?

Umzekeliso 3

Iziphumo zikaSweeney

Izazisi zeQuasi

% ichongwe ngokukodwa kubemi base-US (248 yezigidi)

I-ZIP eneenombolo ezi-5, isini, umhla wokuzalwa

87%

indawo, isini, umhla wokuzalwa

53%

lizwe, isini, umhla wokuzalwa

18%

Lo mzekelo ubonakalisa ukuba kunokuba lula ngokulula ukuba ungazichazi igama lomntu kwidatha ebonakala ingaziwa. Okokuqala, olu phononongo lubonisa ubungakanani obukhulu bomngcipheko, njengoko I-87% yabemi base-US inokuchongwa ngokulula kusetyenziswa iimpawu ezimbalwa. Okwesibini, idatha yezonyango ebonakalisiweyo kolu phononongo yayiqwalaselene kakhulu. Imizekelo yeedatha zabantu abatyhilekileyo ezivela kwidasetaset yondwendwela esibhedlele ibandakanya ubuhlanga, isifo kunye nonyango. Iimpawu anokukhetha ukuzigcina ziyimfihlo, umzekelo, kwiinkampani zeinshurensi.

3) Abantu abanolwazi

Omnye umngcipheko wokususa kuphela izinto ezichazayo, ezinje ngamagama, zivela xa abantu abanolwazi benolwazi oluphezulu okanye ulwazi malunga neempawu okanye isimilo sabantu abathile kwidathaset. Ngokusekwe kulwazi lwabo, umhlaseli uya kuba nakho ukudibanisa iirekhodi ezithile zedatha ebantwini.

Into yokuphonononga

Umzekelo wokuhlaselwa kwedathasethi usebenzisa ulwazi oluphezulu yimeko yeeteksi eNew York, apho iAtockar (2014) ikwazile ukubhengeza abantu abathile. Idathasethi eqeshiweyo iqulethe lonke uhambo lweeteksi eNew York, olucebise ngeempawu ezisisiseko ezinje ngokulungelelanisa ukuqala, ulungelelwaniso lokuphela, ixabiso kunye neencam zohambo.

Umntu onolwazi owaziyo ukuba iNew York ikwazile ukufumana uhambo lweteksi ukuya kwiklabhu yabantu abadala 'iHustler'. Ngokucoca 'indawo ekugqityelwa kuyo', wanciphisa iidilesi zokuqala ngqo kwaye ke wachonga iindwendwe ezahlukeneyo rhoqo. Kwangokunjalo, umntu unokuthatha ukukhwela iteksi xa idilesi yasekhaya yomntu eyaziwayo. Ixesha kunye nendawo yeenkwenkwezi ze-movie ezidumileyo zafunyanwa kwiindawo zokuhleba. Emva kokudibanisa olu lwazi nedatha yeeteksi ze-NYC, kwakulula ukufumana abakhweli beeteksi, isixa abasihlawuleleyo, nokuba babethambile na.

Umzekeliso 4

Umntu onolwazi

idrophu inxibelelanisa iHustler

UBradley Cooper

iteksi kunye nemephu

Jessica Alba

ukulandelwa kweemephu

4) Idatha njengeprint zeminwe

Umgca oqhelekileyo wempikiswano kukuba 'le datha ayinamsebenzi' okanye 'akukho mntu unokwenza nantoni na ngale datha'. Oku kuhlala kukungaqondi. Nokuba eyona datha imsulwa inokwenza 'iminwe yeminwe' eyahlukileyo kwaye isetyenziselwe ukuphinda uchonge umntu ngamnye. Ngumngcipheko ovela kwinkolelo yokuba idatha ngokwayo ayinaxabiso, ngelixa ingenjalo.

Umngcipheko wokuchongwa uya kunyuka ngokwanda kwedatha, i-AI, kunye nezinye izixhobo kunye nealgorithms ezenza ukuba kutyhilwe ubudlelwane obuntsonkothileyo kwidatha. Ngenxa yoko, nokuba isiseko sedatha sakho asinakufunyanwa ngoku, kwaye kungenamsebenzi kubantu abangagunyaziswanga namhlanje, isenokungabi ngomso.

Into yokuphonononga

Umzekelo omkhulu yimeko apho iNetflix ijolise ekufumaneni isebe le-R & D ngokwazisa ukhuphiswano oluvulekileyo lweNetflix ukuphucula inkqubo yabo yokucebisa ngemovie. Lowo uphucula ubambiswano lokucoca ulungelelwaniso ukuqikelela ukulinganiswa komsebenzisi kwiifilimu uphumelele ibhaso le-US $ 1,000,000 '. Ukuxhasa isihlwele, iNetflix yapapasha idathasethi equlathe kuphela ezi zinto zilandelayo: umsebenzisiID, imovie, umhla webanga kunye nebakala (ke akukho lwazi longezelelekileyo ngomsebenzisi okanye kwifilimu uqobo).

Umzekeliso 5

Ulwakhiwo lwedatha yeNetflix ixabiso

Isazisi somsebenzisi I-Movie Umhla webakala kwiBanga
123456789 Umsebenzi ongenakwenzeka 10-12-2008 4

Ngaphandle kwedwa, idatha ibonakale ililize. Xa ubuza umbuzo othi 'Ngaba lukhona ulwazi lwabathengi kwi-dathasethi ekufuneka igcinwe iyimfihlo?', Impendulo ibisithi:

 'Hayi, lonke ulwazi lokuchonga abathengi lususiwe; konke okuseleyo kukulinganiswa kunye nemihla. Oku kulandela umgaqo-nkqubo wethu wabucala… '

Nangona kunjalo, uNarayanan (2008) osuka kwiDyunivesithi yaseTexas eAustin wangqina ngenye indlela. Ukudityaniswa kwamabakala, umhla webakala kunye nomdlalo bhanyabhanya womntu ngamnye wenza iifilimu zeminwe ezizodwa. Cinga ngendlela yakho yeNetflix. Ucinga ukuba bangaphi abantu ocinga ukuba babukele iseti yemiboniso bhanyabhanya? Bangaphi ababukele iseti yemiboniso bhanyabhanya ngaxeshanye?

Umbuzo ophambili, ungayitshatisa njani le minwe? Kwakulula. Ngokusekwe kulwazi oluvela kwiwebhusayithi eyaziwa njenge-IMDb yokulinganisela imovie (indawo yogcino bhanyabhanya ekwi-Intanethi), kuya kwenziwa iminwe efanayo. Ngenxa yoko, abantu banokuchongwa kwakhona.

Ngelixa isimilo sokubukela imovie singenakucingelwa njengolwazi olubuthathaka, cinga ngendlela oziphethe ngayo- ungakhathazeka xa esiya esidlangalaleni? Imizekelo ebonelelwe nguNarayanan ephepheni kukhetho lwezopolitiko (ukulinganiswa 'kukaYesu waseNazarete' kunye 'neVangeli kaYohane') kunye nokukhethwa kwezesondo (ukulinganisa 'iBent' kunye 'neQueer njengabantu') ezinokuchithwa ngokulula.

5) Ummiselo woKhuseleko lweDatha ngokuBanzi (GDPR)

I-GDPR isenokungonwabisi kwaphela, okanye imbumbulu yesilivere phakathi kwezihloko zebhlog. Ewe kunjalo, kuyanceda ukufumana iinkcazo ngokuthe ngqo xa kusetyenzwa ngedatha yobuqu. Kuba le bhlog imalunga nokungaqondi okuqinisekileyo kokususa iikholamu njengendlela yokuchonga idatha kunye nokukufundisa njengeprosesa yedatha, masiqale ngokujonga inkcazo yokungachazwa ngokwe-GDPR. 

Ngokwe-recital 26 evela kwi-GDPR, ulwazi olungachazwanga luchazwa njenge:

'Ulwazi olungahambelani nomntu ochongiweyo okanye ochongiweyo wendalo okanye idatha yobuqu engenziwanga ngegama ngendlela yokuba idatha yedatha ayisaziwa okanye ayisabonakali.'

Kuba enye yenkqubo yedatha yobuqu enxulumene nomntu wendalo, yinxalenye yesi-2 yenkcazo efanelekileyo kuphela. Ukuze uhambelane nenkcazo, umntu kufuneka aqinisekise ukuba isifundo sedatha (umntu) asiyiyo okanye ayisabonakali. Njengoko kubonisiwe kule bhlog, nangona kunjalo, kulula kakhulu ngokuchonga abantu ngokusekwe kwiimpawu ezimbalwa. Ke, ukususa amagama kwidathas akuhambelani nenkcazo ye-GDPR yokungaziwa.

Ukuququmbela

Sicel'umngeni kwinto eqhelekileyo ethathelwa ingqalelo kwaye, ngelishwa, indlela esetyenziswa rhoqo yokuchazwa kwedatha: ukususa amagama. Kwingqikelelo Ngubani umdlalo kunye neminye imizekelo emine malunga:

  • Ukuhlaselwa kwamakhonkco
  • Abantu abanolwazi
  • Idatha njengeprint zeminwe
  • Umgaqo-nkqubo woKhuseleko loLwazi jikelele (GDPR)

kwaboniswa ukuba ukususa amagama kusilele njengokungaziwa. Nangona imizekelo yamatyala eqhankqalazayo, nganye ibonisa ukulula kokuchongwa kwakhona kunye nefuthe elibi elinokubakho kubucala babantu.

Ukuqukumbela, ukususwa kwamagama kwindawo yakho yedatha akubangeli idatha engaziwayo. Ngenxa yoko, kungcono siphephe ukusebenzisa omabini la magama ngokutshintshana. Ndiyathemba ukuba ngekhe uyisebenzise le ndlela yokwenza ukuba ungachazwa. Kwaye, ukuba usayenza, qinisekisa ukuba wena neqela lakho niyayiqonda ngokupheleleyo imingcipheko yabucala, kwaye nivunyelwe ukwamkela loo mngcipheko egameni labantu abachaphazelekayo.

iqela labantu abancumileyo

Idatha yenziwe, kodwa iqela lethu liyinyani!

Nxibelelana noSyntho kwaye enye yeengcali zethu iya kunxibelelana nawe ngesantya sokukhanya ukuphonononga ixabiso ledatha eyenziweyo!

  • D. Reinsel, J. Gantz, uJohn Rydning. Ukudityaniswa kwehlabathi ukusuka kwi-Edge ukuya kwi-Core, kwiDatha yobudala 2025, 2018
  • L. Sweeney. k-ukungaziwa: imodeli yokukhusela imfihlo. Ijenali yaMazwe ngaMazwe yokungaqiniseki, iiFuzziness kunye neeNkqubo eziSekwe kuLwazi, 10 (5), 2002: 557-570
  • L. Sweeney. Ubalo lwabantu olulula luhlala luchonga abantu ngokuKhethekileyo. IYunivesithi yaseCarnegie Mellon, iPhepha eliSebenzayo lokuSebenza ngePhepha 3. IPittsburgh 2000
  • P. Samarati. Ukukhusela iZazisi zaBaphenduli kwiMicrodata Release. Intengiselwano ye-IEEE kuLwazi nakwiNjineli yeDatha, 13 (6), 2001: 1010-1027
  • I-Atockar. Ukuhamba ngeenkwenkwezi: Imfihlo yabakhweli kwi-NYC Taxicab Dataset, ngo-2014
  • UNarayanan, A., kunye noShmatikov, V. (2008). Ukusetyenziswa ngokungachazwanga kweedasethi ezinkulu ezinqabileyo. Kwinkqubo-2008 IEEE Symposium yoKhuseleko kunye nokuBucala, SP (iphe. 111-125)
  • Ummiselo woKhuseleko lweDatha ngokuBanzi (i-GDPR), i-Recital 26, ayisebenzi kwiDatha engaziwayo