Kace wane? Misalai 5 da yasa cire sunaye ba zaɓi bane

tunanin wane game

Gabatarwa ga Tsammani Wanene

Tsammani Wanene? Kodayake na tabbata cewa yawancin ku sun san wannan wasan daga baya a cikin kwanakin, anan ga takaitaccen bayani. Makasudin wasan: gano sunan halin zane mai ban dariya wanda abokin adawar ku ya zaɓa ta hanyar yin tambayoyin 'yes' da 'a'a', kamar 'mutumin yana sanya hula?' ko 'mutum yana sanya tabarau'? 'Yan wasan suna kawar da' yan takara dangane da martanin abokin hamayya kuma suna koyan halayen da ke da alaƙa da halayyar asirin abokin hamayyarsu. Dan wasan da ya fara lissafa halin sirrin ɗan wasan ya lashe wasan.

Kun samu. Dole ne mutum ya tantance mutum daga cikin dataset ɗin ta hanyar samun dama ga halayen da suka dace. A zahiri, koyaushe muna ganin wannan ra'ayi na Tsammani Wanda ya yi aiki a aikace, amma sai aka yi amfani da shi a kan bayanan da aka tsara tare da layuka da ginshiƙai masu ɗauke da halayen mutane na ainihi. Babban banbanci lokacin aiki tare da bayanai shine mutane suna ƙin yin la'akari da sauƙin da za a iya ɓoye ainihin mutane ta hanyar samun dama ga wasu sifofi kaɗan.

Kamar yadda wasan Guess Who ya nuna, wani zai iya tantance mutane ta hanyar samun dama ga wasu sifofi kaɗan. Yana zama misali mai sauƙi na me yasa cire 'sunayen' kawai (ko wasu masu gano kai tsaye) daga bayanan bayananku ya gaza azaman dabarun ɓoyewa. A cikin wannan rukunin yanar gizon, muna ba da lamuran aiki guda huɗu don sanar da ku game da haɗarin sirrin da ke tattare da cire ginshiƙai a matsayin hanyar ɓoyayyiyar bayanai.

2) Hare -hare na Haɗin kai: bayanan bayanan ku yana da alaƙa da wasu hanyoyin (na jama'a)

Haɗarin kai hare -haren haɗin gwiwa shine mafi mahimmancin dalilin da yasa kawai cire sunaye baya aiki (babu kuma) azaman hanyar ɓoye sunaye. Tare da harin haɗin gwiwa, maharin ya haɗa bayanan asali tare da sauran hanyoyin samun bayanai don keɓance mutum da kuma koyan (sau da yawa mahimmanci) bayanai game da wannan mutumin.

Maɓalli anan shine samuwar wasu albarkatun bayanai waɗanda suke yanzu, ko kuma suna iya kasancewa a nan gaba. Ka yi tunanin kanka. Nawa keɓaɓɓun bayanan keɓaɓɓun ku za a iya samu akan Facebook, Instagram ko LinkedIn waɗanda za a iya cin zarafin su don kai hari?

A cikin kwanakin da suka gabata, kasancewar bayanai sun fi iyakancewa, wanda a wani ɓangare yana bayanin dalilin cire sunayen ya isa don adana sirrin mutane. Ƙananan bayanan da ake samu yana nufin ƙarancin damar don haɗa bayanai. Koyaya, yanzu mu masu aiki ne (masu aiki) a cikin tattalin arzikin da ake sarrafa bayanai, inda adadin bayanai ke ƙaruwa cikin ƙima. Ƙarin bayanai, da haɓaka fasaha don tattara bayanai za su haifar da haɓaka yiwuwar kai hare -hare. Menene mutum zai rubuta a cikin shekaru 10 game da haɗarin harin haɗin gwiwa?

Misalin 1

Ƙaruwar bayanai da ke ƙaruwa gaskiya ce

Cikakkun bayanai

Nazarin hali

Sweeney (2002) ta nuna a cikin takardar ilimi yadda ta sami damar ganowa da kuma dawo da bayanan likitanci masu mahimmanci daga mutane dangane da haɗa bayanan jama'a na 'ziyartar asibiti' zuwa mai rejista na zaɓe a bainar jama'a a Amurka. Duk bayanan bayanan inda aka ɗauka cewa ba a bayyana su da kyau ta hanyar share sunaye da sauran masu gano kai tsaye.

Misalin 2

Haɗin kai a aikace

Haɗin Haɗin kai

Dangane da sigogi uku kawai (1) Lambar Zip, (2) Jinsi da (3) Ranar Haihuwa, ta nuna cewa 87% na duk yawan jama'ar Amurka za a iya sake gano su ta hanyar dacewa da abubuwan da aka ambata daga bayanan bayanan biyu. Daga nan Sweeney ta maimaita aikinta tare da samun 'ƙasa' a madadin 'Zip Code'. Bugu da ƙari, ta nuna cewa kashi 18% na yawan jama'ar Amurka za a iya gano su kawai ta hanyar samun damar yin amfani da bayanan da ke ɗauke da bayanai game da (1) ƙasar gida, (2) jinsi da (3) ranar haihuwa. Ka yi tunani game da kafofin jama'a da aka ambata, kamar Facebook, LinkedIn ko Instagram. Ana iya ganin ƙasarka, jinsi da ranar haihuwa, ko wasu masu amfani suna iya cire ta?

Misalin 3

Sakamakon Sweeney

Masu bincike

An gano musamman na yawan jama'ar Amurka (miliyan 248)

ZIP mai lamba 5, jinsi, ranar haihuwa

87%

wuri, jinsi, ranar haihuwa

53%

kasar, jinsi, ranar haihuwa

18%

Wannan misalin yana nuna cewa yana iya zama mai sauƙin sauƙi don ɓoye sunan mutane a cikin bayanan da ba a sani ba. Na farko, wannan binciken yana nuna babbar haɗarin, kamar 87% na yawan jama'ar Amurka ana iya gano su cikin sauƙin amfani 'yan halaye. Na biyu, bayanan likita da aka fallasa a cikin wannan binciken sun kasance masu matukar damuwa. Misalan bayanan mutanen da aka fallasa daga bayanan ziyartar asibiti sun haɗa da ƙabilanci, ganewar asali da magani. Halayen da mutum zai iya ɓoye sirri, misali, daga kamfanonin inshora.

3) Mutanen da aka sani

Wani haɗarin cire masu ganowa kai tsaye kawai, kamar sunaye, yana tasowa lokacin da mutanen da aka sanar suna da babban ilimi ko bayani game da halaye ko halayen takamaiman mutane a cikin bayanan bayanan.. Dangane da ilimin su, maharin na iya iya danganta takamaiman bayanan bayanai ga ainihin mutane.

Nazarin hali

Misalin farmaki kan dataset ta amfani da ingantaccen ilimi shine shari'ar taksi na New York, inda Atockar (2014) ya sami damar buɗe takamaiman mutane. Bayanan bayanan da ke aiki ya ƙunshi duk tafiye -tafiyen taksi a cikin New York, wadata tare da sifofi na asali kamar daidaiton farawa, daidaitawa na ƙarshe, farashi da ƙarshen hawan.

Mutumin da ya sani wanda ya san New York ya sami damar samun tafiye -tafiyen taksi zuwa kulob mai girma 'Hustler'. Ta hanyar tace 'ƙarshen wurin', ya cire ainihin adireshin farawa kuma ta haka ne ya gano baƙi daban -daban. Hakanan, mutum na iya cire taksi yayin da aka san adireshin gidan mutumin. An gano lokaci da wurin wasu fitattun taurarin fina -finai a shafukan tsegumi. Bayan haɗa wannan bayanin zuwa bayanan taksi na NYC, yana da sauƙi a sami hawan taksi ɗin su, adadin da suka biya, da kuma ko sun ba da shawara.

Misalin 4

Mutum mai sani

drop-off yana daidaita Hustler

Bradley Cooper

taxi da taswira

Jessica Alba

bin taswira

4) Bayanai a matsayin zanen yatsa

Hanya ɗaya ta muhawara ita ce 'wannan bayanan ba shi da amfani' ko 'babu wanda zai iya yin komai da wannan bayanan'. Wannan sau da yawa kuskure ne. Ko da mafi yawan bayanan marasa laifi na iya ƙirƙirar 'sawun yatsa' na musamman kuma ana iya amfani da su don sake tantance mutane. Hadari ne da aka samo daga imani cewa bayanan da kansu ba su da amfani, alhali ba haka bane.

Hadarin ganewa zai ƙaru tare da haɓaka bayanai, AI, da sauran kayan aiki da algorithms waɗanda ke ba da damar fallasa hadaddun alaƙa a cikin bayanai. Sakamakon haka, ko da ba za a iya fallasa bayanan bayanan ku a yanzu ba, kuma mai yiwuwa ba shi da amfani ga mutanen da ba su da izini a yau, maiyuwa ba gobe ba ne.

Nazarin hali

Babban misali shine yanayin da Netflix ya yi niyyar tara jama'a sashin R&D ta hanyar gabatar da gasar Netflix buɗe don haɓaka tsarin shawarwarin fim ɗin su. 'Wanda ke haɓaka algorithm na haɗin gwiwa don yin hasashen ƙimar masu amfani don fina -finai ya sami kyautar $ 1,000,000'. Don tallafa wa taron, Netflix ya buga bayanan bayanai da ke ɗauke da waɗannan mahimman halaye masu zuwa kawai: mai amfani, fim, kwanan wata da daraja (don haka babu ƙarin bayani kan mai amfani ko fim ɗin kansa).

Misalin 5

Tsarin tsarin bayanai Netflix farashin

Mai amfani movie Kwanan darasi Grade
123456789 Jakadancin ba zai yiwu ba 10-12-2008 4

A ware, bayanan sun bayyana banza. Lokacin yin tambayar 'Shin akwai wani bayanin abokin ciniki a cikin bayanan da yakamata a kiyaye shi?', Amsar ita ce:

 'A'a, an cire duk bayanan da ke tantance abokin ciniki; duk abin da ya rage shine kimantawa da kwanan wata. Wannan ya biyo bayan manufar sirrinmu… ''

Koyaya, Narayanan (2008) daga Jami'ar Texas a Austin ya tabbatar da haka. Haɗin maki, ranar sa da fim ɗin mutum yana samar da yatsa na musamman na fim. Ka yi tunani game da halayen Netflix naka. Mutane nawa kuke ganin sun kalli shirin fina -finai iri ɗaya? Mutane nawa ne suka kalli shirin fina -finai iri ɗaya a lokaci guda?

Babban tambaya, yadda za a daidaita wannan zanen yatsan? Ya kasance mai sauƙi. Dangane da bayanai daga sanannen gidan yanar gizo mai kimanta fim ɗin IMDb (Database na Intanet na Intanet), ana iya ƙirƙirar irin wannan zanen yatsa. Sakamakon haka, ana iya sake tantance mutane.

Duk da yake ba za a ɗauka ɗabi'ar kallon fim ɗin a matsayin muhimmin bayani ba, yi tunani game da ɗabi'ar ku-za ku damu idan ta fito fili? Misalan da Narayanan ya bayar a cikin takardarsa zaɓin siyasa ne (kimantawa akan 'Yesu Nazaret' da 'Linjilar Yahaya') da abubuwan son jima'i (ƙima akan 'Bent' da 'Queer as folk') waɗanda za a iya narkar da su cikin sauƙi.

5) Dokar Kariyar Bayanai (GDPR)

GDPR na iya zama mai ban sha'awa, ko harsashin azurfa tsakanin batutuwan blog. Duk da haka, yana da taimako don samun ma'anar kai tsaye lokacin sarrafa bayanan sirri. Tunda wannan shafin yanar gizon yana game da kuskuren yau da kullun na cire ginshiƙai azaman hanyar ɓoye bayanai da ilmantar da ku azaman mai sarrafa bayanai, bari mu fara tare da binciko ma'anar anonomization a cewar GDPR. 

Dangane da karatun 26 daga GDPR, an bayyana bayanan da ba a sani ba a matsayin:

'bayanan da ba su da alaƙa da wani mutum da aka sani ko mai ganewa ko bayanan sirri da aka sanya ba a sani ba ta hanyar da ba za a iya gane batun batun ba ko kuma a daina ganewa.'

Tunda mutum yana aiwatar da bayanan sirri wanda ke da alaƙa da mutum na halitta, sashi na 2 kawai na ma'anar ya dace. Domin bin ƙa'idar, dole ne mutum ya tabbatar cewa batun bayanan (mutum) ba a iya ganewa ko kuma a daina gane su. Kamar yadda aka nuna a cikin wannan rukunin yanar gizon, duk da haka, yana da sauƙin sauƙi don gano mutane dangane da wasu sifofi. Don haka, cire sunaye daga bayanan bayanai bai dace da ma'anar GDPR na anonmization ba.

a ƙarshe

Mun ƙalubalanci wanda aka yi la'akari da shi, da rashin alheri, har yanzu ana amfani da tsarin amfani da bayanan sirri: cire sunaye. A cikin Tsammani Wanene wasan da wasu misalai huɗu game da:

  • Haɗin kai
  • Mutane masu sani
  • Bayanai a matsayin zanen yatsa
  • Dokar Kare Kariyar Kari (GDPR)

an nuna cewa cire sunaye ya gaza a matsayin ɓoye suna. Kodayake misalai lamura ne masu ban mamaki, kowannensu yana nuna sauƙin sake ganewa da yuwuwar mummunan tasiri kan sirrin daidaikun mutane.

A ƙarshe, cire sunaye daga bayanan bayananku ba ya haifar da bayanan da ba a sani ba. Don haka, ya fi kyau mu guji amfani da waɗannan sharuɗɗan sau biyu. Ina fatan da gaske ba za ku yi amfani da wannan hanyar don ɓoye suna ba. Kuma, idan har yanzu kuna yin hakan, tabbatar cewa ku da ƙungiyar ku sun fahimci haɗarin sirrin, kuma an ba ku izinin karɓar waɗannan haɗarin a madadin mutanen da abin ya shafa.

kungiyar mutane suna murmushi

Bayanai na roba ne, amma ƙungiyarmu ta gaske ce!

Tuntuɓi Syntho kuma daya daga cikin masananmu zai tuntube ku a cikin saurin haske don bincika darajar bayanan roba!

  • D. Reinsel, J. Gantz, John Rydning. Digitization na Duniya Daga Edge zuwa Core, Age Age 2025, 2018
  • L. Sweeney. k-anonymity: samfuri don kare sirri. Jaridar Kasa da Kasa kan Rashin Tabbatacciya, Hauka da Tsarin tushen Ilimi, 10 (5), 2002: 557-570
  • L. Sweeney. Sauƙaƙan alƙalumai Sau da yawa Suna Gane Mutane Musamman. Jami'ar Carnegie Mellon, Takardar Aikin Sirrin Bayanai 3. Pittsburgh 2000
  • P. Samarati. Kare Ilimin Masu Amsawa a Sakin Microdata. Ayyukan IEEE akan Ilimi da Injiniyan Bayanai, 13 (6), 2001: 1010-1027
  • Atockar. Hawa tare da Taurari: Sirrin Fasinja a cikin NYC Taxicab Dataset, 2014
  • Narayanan, A., & Shmatikov, V. (2008). Ƙarfafawa ta ɓoye bayanan manyan bayanan bayanai. A Ci gaba-Taron IEEE na 2008 akan Tsaro da Sirri, SP (shafi na 111-125)
  • Dokar Kariyar Bayanai (GDPR), Labari na 26, Ba a Aiwatar da Bayanan Ba ​​da Bayani