{"id":5816,"date":"2025-01-14T07:00:00","date_gmt":"2025-01-14T06:00:00","guid":{"rendered":"https:\/\/viva.racunalniske-novice.com\/kako-pomanjkanje-podatkov-ogroza-prihodnost-umetne-inteligence\/"},"modified":"2025-01-14T07:00:00","modified_gmt":"2025-01-14T06:00:00","slug":"kako-pomanjkanje-podatkov-ogroza-prihodnost-umetne-inteligence","status":"publish","type":"post","link":"https:\/\/viva.racunalniske-novice.com\/hr\/kako-nedostatak-podataka-ugrozava-buducnost-umjetne-inteligencije\/","title":{"rendered":"Kako nedostatak podataka ugro\u017eava budu\u0107nost umjetne inteligencije"},"content":{"rendered":"<p class=\"wp-block-paragraph\">Svijet umjetne inteligencije suo\u010dava se s nedostatkom svoje najvrjednije sirovine \u2013 podataka. To je potaknulo rasprave o sve popularnijoj alternativi: sinteti\u010dkim ili \u010dak \u201ela\u017enim\u201c podacima. Godinama tvrtke poput OpenAI-a i Googlea rudare podatke s interneta kako bi obu\u010davale modele velikih jezika (LLM) koji pokre\u0107u njihova AI rje\u0161enja. Ti su modeli probavili ogromne koli\u010dine sadr\u017eaja koji su generirali ljudi, od istra\u017eiva\u010dkih radova i romana do YouTube videa.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Sada tih podataka polako ponestaje, njihova koli\u010dina postaje sve ograni\u010denija. Odre\u0111eni glavni igra\u010di na tom polju, poput direktora OpenAI-ja Sama Altmana, vjeruju da \u0107e samou\u010de\u0107i modeli mo\u0107i koristiti sinteti\u010dke podatke, \u0161to bi predstavljalo jeftin i gotovo beskona\u010dan izvor podataka.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Me\u0111utim, istra\u017eiva\u010di upozoravaju na rizike. Sinteti\u010dki podaci mogli bi smanjiti kvalitetu modela, jer mogu biti \u201eotrovani\u201c vlastitim pogre\u0161kama. <a href=\"https:\/\/arxiv.org\/pdf\/2305.17493\" target=\"_blank\" rel=\"noreferrer noopener\">Istra\u017eivanja sveu\u010dili\u0161ta Oxford i Cambridge<\/a> pokazali su da isklju\u010divo hranjenje modela sinteti\u010dkim podacima dovodi do lo\u0161ih rezultata i \u201ebesmislenosti\u201c. Po njihovom mi\u0161ljenju, klju\u010dna je uravnote\u017eena upotreba sinteti\u010dkih i stvarnih podataka.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Sve vi\u0161e tvrtki stvara sinteti\u010dke podatke<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Nedostatak podataka navodi tvrtke da tra\u017ee alternative, poput sinteti\u010dkih podataka koje generiraju UI sustavi na temelju stvarnih podataka. Tehnolo\u0161ke tvrtke, uklju\u010duju\u0107i OpenAI i Google, ve\u0107 pla\u0107aju milijune za pristup podacima s platformi poput Reddita i raznih medijskih ku\u0107a, jer web stranice sve vi\u0161e ograni\u010davaju besplatnu upotrebu svojih sadr\u017eaja. Me\u0111utim, resursi su ograni\u010deni.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Nvidia, Tencent i startupi Gretel i SynthLabs razvijaju alate za stvaranje sinteti\u010dkih podataka koji su \u010desto \u010di\u0161\u0107i i specifi\u010dniji od podataka koje generiraju ljudi. S Llama 3.1, Meta je koristila sinteti\u010dke podatke za pobolj\u0161anje vje\u0161tina kao \u0161to su programiranje i rje\u0161avanje matemati\u010dkih problema. Sinteti\u010dki podaci tako\u0111er nude mogu\u0107nost smanjenja pristranosti svojstvene stvarnim podacima, iako istra\u017eiva\u010di upozoravaju da osiguravanje to\u010dnosti i nepristranosti ostaje veliki izazov.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">&quot;Habsbur\u0161ka&quot; umjetna inteligencija<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Iako sinteti\u010dki podaci donose prednosti, oni tako\u0111er predstavljaju ozbiljne rizike. <a href=\"https:\/\/scontent-lhr8-1.xx.fbcdn.net\/v\/t39.2365-6\/453304228_1160109801904614_7143520450792086005_n.pdf?_nc_cat=108&amp;ccb=1-7&amp;_nc_sid=3c67a6&amp;_nc_ohc=PC3CtquZIecQ7kNvgEd56UN&amp;_nc_ht=scontent-lhr8-1.xx&amp;oh=00_AYCZLndlqJrzHln7YJPZgA20dTYBRdoZWwrQxxzEPpDRPQ&amp;oe=66B815C7\" target=\"_blank\" rel=\"noreferrer noopener\">Meta istra\u017eivanje o modelu Llama 3.1 <\/a>pokazalo je da treniranje modela na vlastitim sinteti\u010dkim podacima zapravo mo\u017ee smanjiti njegove performanse. Sli\u010dno tome, <a href=\"https:\/\/affiliate.insider.com\/?h=be468ee8ce72a81899f985f5f8550abdd05c3e150925e5b87d82d02e5a0c7498&amp;platform=browser&amp;postID=66acfb7ee3c9582388c4ffd0&amp;postSlug=ai-synthetic-data-industry-debate-over-fake-2024-8&amp;site=bi&amp;u=https%3A%2F%2Fwww.nature.com%2Farticles%2Fs41586-024-07566-y\" target=\"_blank\" rel=\"noreferrer noopener\">studija u \u010dasopisu Nature<\/a> upozorio je da nekontrolirana upotreba sinteti\u010dkih podataka dovodi do \u201ekolapsa modela\u201c, \u0161to su istra\u017eiva\u010di usporedili s genetskom degeneracijom i simboli\u010dno nazvali fenomen \u201ehabsbur\u0161ka umjetna inteligencija\u201c. Termin koji je skovao istra\u017eiva\u010d Jathan Sadowski.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Glavno pitanje ostaje: koliko je sinteti\u010dkih podataka previ\u0161e? Neki stru\u010dnjaci predla\u017eu kori\u0161tenje hibridnih podataka, gdje se sinteti\u010dki podaci kombiniraju sa stvarnim podacima kako bi se sprije\u010dila degradacija modela. Tvrtke poput Scale AI istra\u017euju ovaj pristup, a njihov izvr\u0161ni direktor Alexandr Wang vjeruje da je hibridni pristup \u201eprava budu\u0107nost\u201c.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Pronala\u017eenje novih rje\u0161enja<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">U sije\u010dnju je Google DeepMind predstavio AlphaGeometry, sustav koji rje\u0161ava geometrijske probleme na izuzetno visokoj razini koriste\u0107i \u201eneuro-simboli\u010dki\u201c pristup. Kombinira prednosti dubokog u\u010denja s intenzivnom obradom podataka i zaklju\u010divanja temeljenog na pravilima. Model je u potpunosti obu\u010den na sinteti\u010dkim podacima i smatra se potencijalnim korakom prema op\u0107oj umjetnoj inteligenciji.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Neuro-simboli\u010dko podru\u010dje je jo\u0161 uvijek mlado, ali moglo bi ponuditi obe\u0107avaju\u0107i smjer za budu\u0107nost razvoja umjetne inteligencije. Pod pritiskom monetizacije, tvrtke poput OpenAI-ja, Googlea i Microsofta isprobat \u0107e sva mogu\u0107a rje\u0161enja za prevladavanje podatkovne krize. <\/p>","protected":false},"excerpt":{"rendered":"<p>Svet umetne inteligence se soo\u010da s pomanjkanjem svoje najdragocenej\u0161e surovine \u2013 podatkov. To je spro\u017eilo razprave o vse bolj priljubljeni alternativi: sinteti\u010dnih ali celo &#8220;la\u017enih&#8221; podatkih. Dolga leta so podjetja, kot sta OpenAI in Google, za u\u010denje velikih jezikovnih modelov (LLM-jev), ki poganjajo njihove UI re\u0161itve, pridobivala podatke z interneta. Ti modeli so prebavili ogromne [&hellip;]<\/p>","protected":false},"author":2,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[66,4],"tags":[192],"class_list":["post-5816","post","type-post","status-publish","format-standard","hentry","category-programi","category-racunalnistvo-telefonija","tag-umetna-inteligenca"],"acf":{"subtitle":"Umetna inteligenca se soo\u010da s pomanjkanjem klju\u010dnih podatkov, kar vodi k uporabi sinteti\u010dnih re\u0161itev. So \"la\u017eni\" podatki lahko prihodnost umetne inteligence ali tveganje za kakovost modelov in njihove zmogljivosti?","heading":"","summary":"Umetna inteligenca se soo\u010da s pomanjkanjem klju\u010dnih podatkov, kar vodi k uporabi sinteti\u010dnih re\u0161itev. So \"la\u017eni\" podatki lahko prihodnost umetne inteligence ali tveganje za kakovost modelov in njihove zmogljivosti?","thumbnail_small":"https:\/\/racunalniske-novice.com\/wp-content\/uploads\/2025\/01\/ali-shah-lakhani-sp1BZ1atp7M-unsplash-560x315.jpg","thumbnail_large":"https:\/\/racunalniske-novice.com\/wp-content\/uploads\/2025\/01\/ali-shah-lakhani-sp1BZ1atp7M-unsplash-768x1024.jpg","thumbnail_caption":"","gallery":"","video_gallery":null,"author":"","links":null,"sources":[{"title":"Business Insider","url":"https:\/\/www.businessinsider.com\/ai-synthetic-data-industry-debate-over-fake-2024-8?utm_source=chatgpt.com"},{"title":"Unsplash","url":"https:\/\/unsplash.com\/photos\/assorted-source-codes-sp1BZ1atp7M"}],"skip_language":[]},"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v22.8 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Kako pomanjkanje podatkov ogro\u017ea prihodnost umetne inteligence - Ra\u010dunalni\u0161ke novice<\/title>\n<meta name=\"description\" content=\"Umetna inteligenca se soo\u010da s pomanjkanjem klju\u010dnih podatkov, kar vodi k uporabi sinteti\u010dnih re\u0161itev. So &quot;la\u017eni&quot; podatki lahko prihodnost umetne inteligence ali tveganje za kakovost modelov in njihove\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/posts\/5816\" \/>\n<meta property=\"og:locale\" content=\"hr_HR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Kako pomanjkanje podatkov ogro\u017ea prihodnost umetne inteligence - Ra\u010dunalni\u0161ke novice\" \/>\n<meta property=\"og:description\" content=\"Svet umetne inteligence se soo\u010da s pomanjkanjem svoje najdragocenej\u0161e surovine \u2013 podatkov. To je spro\u017eilo razprave o vse bolj priljubljeni alternativi: sinteti\u010dnih ali celo &#8220;la\u017enih&#8221; podatkih. Dolga leta so podjetja, kot sta OpenAI in Google, za u\u010denje velikih jezikovnih modelov (LLM-jev), ki poganjajo njihove UI re\u0161itve, pridobivala podatke z interneta. Ti modeli so prebavili ogromne [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/viva.racunalniske-novice.com\/hr\/kako-nedostatak-podataka-ugrozava-buducnost-umjetne-inteligencije\/\" \/>\n<meta property=\"og:site_name\" content=\"Ra\u010dunalni\u0161ke novice\" \/>\n<meta property=\"article:published_time\" content=\"2025-01-14T06:00:00+00:00\" \/>\n<meta name=\"author\" content=\"sinusiks\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Napisao\/la\" \/>\n\t<meta name=\"twitter:data1\" content=\"sinusiks\" \/>\n\t<meta name=\"twitter:label2\" content=\"Procijenjeno vrijeme \u010ditanja\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/kako-pomanjkanje-podatkov-ogroza-prihodnost-umetne-inteligence\/\",\"url\":\"https:\/\/viva.racunalniske-novice.com\/kako-pomanjkanje-podatkov-ogroza-prihodnost-umetne-inteligence\/\",\"name\":\"Kako pomanjkanje podatkov ogro\u017ea prihodnost umetne inteligence - Ra\u010dunalni\u0161ke novice\",\"isPartOf\":{\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#website\"},\"datePublished\":\"2025-01-14T06:00:00+00:00\",\"dateModified\":\"2025-01-14T06:00:00+00:00\",\"author\":{\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4\"},\"breadcrumb\":{\"@id\":\"https:\/\/viva.racunalniske-novice.com\/kako-pomanjkanje-podatkov-ogroza-prihodnost-umetne-inteligence\/#breadcrumb\"},\"inLanguage\":\"hr\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/viva.racunalniske-novice.com\/kako-pomanjkanje-podatkov-ogroza-prihodnost-umetne-inteligence\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/kako-pomanjkanje-podatkov-ogroza-prihodnost-umetne-inteligence\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/viva.racunalniske-novice.com\/en\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Kako pomanjkanje podatkov ogro\u017ea prihodnost umetne inteligence\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#website\",\"url\":\"https:\/\/viva.racunalniske-novice.com\/en\/\",\"name\":\"Ra\u010dunalni\u0161ke novice\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/viva.racunalniske-novice.com\/en\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"hr\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4\",\"name\":\"sinusiks\",\"sameAs\":[\"https:\/\/ml.racunalniske-novice.com\"],\"url\":\"https:\/\/viva.racunalniske-novice.com\/hr\/author\/sinusiks\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Kako pomanjkanje podatkov ogro\u017ea prihodnost umetne inteligence - Ra\u010dunalni\u0161ke novice","description":"Umetna inteligenca se soo\u010da s pomanjkanjem klju\u010dnih podatkov, kar vodi k uporabi sinteti\u010dnih re\u0161itev. So \"la\u017eni\" podatki lahko prihodnost umetne inteligence ali tveganje za kakovost modelov in njihove","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/posts\/5816","og_locale":"hr_HR","og_type":"article","og_title":"Kako pomanjkanje podatkov ogro\u017ea prihodnost umetne inteligence - Ra\u010dunalni\u0161ke novice","og_description":"Svet umetne inteligence se soo\u010da s pomanjkanjem svoje najdragocenej\u0161e surovine \u2013 podatkov. To je spro\u017eilo razprave o vse bolj priljubljeni alternativi: sinteti\u010dnih ali celo &#8220;la\u017enih&#8221; podatkih. Dolga leta so podjetja, kot sta OpenAI in Google, za u\u010denje velikih jezikovnih modelov (LLM-jev), ki poganjajo njihove UI re\u0161itve, pridobivala podatke z interneta. Ti modeli so prebavili ogromne [&hellip;]","og_url":"https:\/\/viva.racunalniske-novice.com\/hr\/kako-nedostatak-podataka-ugrozava-buducnost-umjetne-inteligencije\/","og_site_name":"Ra\u010dunalni\u0161ke novice","article_published_time":"2025-01-14T06:00:00+00:00","author":"sinusiks","twitter_card":"summary_large_image","twitter_misc":{"Napisao\/la":"sinusiks","Procijenjeno vrijeme \u010ditanja":"3 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/viva.racunalniske-novice.com\/kako-pomanjkanje-podatkov-ogroza-prihodnost-umetne-inteligence\/","url":"https:\/\/viva.racunalniske-novice.com\/kako-pomanjkanje-podatkov-ogroza-prihodnost-umetne-inteligence\/","name":"Kako pomanjkanje podatkov ogro\u017ea prihodnost umetne inteligence - Ra\u010dunalni\u0161ke novice","isPartOf":{"@id":"https:\/\/viva.racunalniske-novice.com\/en\/#website"},"datePublished":"2025-01-14T06:00:00+00:00","dateModified":"2025-01-14T06:00:00+00:00","author":{"@id":"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4"},"breadcrumb":{"@id":"https:\/\/viva.racunalniske-novice.com\/kako-pomanjkanje-podatkov-ogroza-prihodnost-umetne-inteligence\/#breadcrumb"},"inLanguage":"hr","potentialAction":[{"@type":"ReadAction","target":["https:\/\/viva.racunalniske-novice.com\/kako-pomanjkanje-podatkov-ogroza-prihodnost-umetne-inteligence\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/viva.racunalniske-novice.com\/kako-pomanjkanje-podatkov-ogroza-prihodnost-umetne-inteligence\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/viva.racunalniske-novice.com\/en\/"},{"@type":"ListItem","position":2,"name":"Kako pomanjkanje podatkov ogro\u017ea prihodnost umetne inteligence"}]},{"@type":"WebSite","@id":"https:\/\/viva.racunalniske-novice.com\/en\/#website","url":"https:\/\/viva.racunalniske-novice.com\/en\/","name":"Ra\u010dunalni\u0161ke novice","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/viva.racunalniske-novice.com\/en\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"hr"},{"@type":"Person","@id":"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4","name":"sinusiks","sameAs":["https:\/\/ml.racunalniske-novice.com"],"url":"https:\/\/viva.racunalniske-novice.com\/hr\/author\/sinusiks\/"}]}},"_links":{"self":[{"href":"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/posts\/5816","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/comments?post=5816"}],"version-history":[{"count":0,"href":"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/posts\/5816\/revisions"}],"wp:attachment":[{"href":"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/media?parent=5816"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/categories?post=5816"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/tags?post=5816"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}