{"id":7527,"date":"2026-04-29T06:05:00","date_gmt":"2026-04-29T04:05:00","guid":{"rendered":"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/"},"modified":"2026-04-29T06:05:00","modified_gmt":"2026-04-29T04:05:00","slug":"google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida","status":"publish","type":"post","link":"https:\/\/viva.racunalniske-novice.com\/hr\/google-deepmind-predstavlja-generalisticki-model-koji-pomice-granice-racunalnog-vida\/","title":{"rendered":"Google DeepMind predstavlja generalisti\u010dki model koji pomi\u010de granice ra\u010dunalnog vida"},"content":{"rendered":"<p>Istra\u017eiva\u010dki tim Google DeepMind-a pokazao je s modelom Vision Banana da prekursori generiranja slika slu\u017ee kao sna\u017ean temelj za op\u0107e razumijevanje vizualnog svijeta, sli\u010dno kao \u0161to modeli velikih jezika (LLM) razvijaju razumijevanje jezika putem predvi\u0111anja sljede\u0107e rije\u010di. Sustav se temelji na Nano Banana Pro-u, Googleovom najnaprednijem generatoru slika, koji je transformiran u Vision Banana putem laganog u\u010denja temeljenog na instrukcijama. Klju\u010dna inovacija je da su razli\u010diti zadaci ra\u010dunalnog vida, poput segmentacije, odre\u0111ivanja dubine i procjene normale povr\u0161ine, transformirani u zadatke generiranja RGB slika.<br><br>Vision Banana postigao je vrhunske rezultate u takozvanim okru\u017eenjima \u201ezero-shot\u201c, gdje model nema prethodnog iskustva sa specifi\u010dnim skupovima podataka. Nadma\u0161io je model SAM 3 u segmentaciji slike, postigav\u0161i metri\u010dki rezultat dubine od 0,929 (parametar \u03b41), pobijediv\u0161i prethodnog rekordera Depth Anything V3 (0,918). Posebno je impresivno da model ne zahtijeva nikakve podatke o parametrima kamere za odre\u0111ivanje dubine, \u0161to je do sada bila glavna prepreka za takve sustave.<br><br>Ovaj pristup pru\u017ea tri klju\u010dne prednosti. Jedan model gdje jedna neuronska mre\u017ea mo\u017ee obavljati \u0161irok raspon zadataka, a mijenja se samo tekstualni upit. Za prilagodbu modela bila je potrebna samo mala koli\u010dina specifi\u010dnih vizualnih podataka. Nadalje, unato\u010d novim analiti\u010dkim mogu\u0107nostima, Vision Banana i dalje u potpunosti zadr\u017eava svoju izvornu funkciju generiranja vrhunskih fotorealisti\u010dnih slika.<br><br>Istra\u017eiva\u010di vjeruju da svjedo\u010dimo promjeni paradigme gdje \u0107e generativno predu\u010denje postati standard za izgradnju op\u0107ih vizualnih modela budu\u0107nosti. Vision Banana nije samo novi alat, ve\u0107 dokaz da sposobnost stvaranja vizualnog sadr\u017eaja implicitno zahtijeva duboko razumijevanje geometrije, semantike i prostornih odnosa u stvarnom svijetu.<\/p>\n<div class=\"embed-container\"><iframe src=\"https:\/\/www.youtube.com\/embed\/I8VUN141MjU\" frameborder=\"0\" allowfullscreen><\/iframe><\/div><br\/>","protected":false},"excerpt":{"rendered":"<p>Raziskovalna ekipa Google DeepMind je z modelom Vision Banana dokazala, da predhodniki za generiranje slik slu\u017eijo kot mo\u010dni temelji za splo\u0161no razumevanje vizualnega sveta, podobno kot veliki jezikovni modeli (LLM) razvijejo razumevanje jezika skozi napovedovanje naslednje besede. Osnova sistema je Nano Banana Pro, Googlov najnaprednej\u0161i generator slik, ki so ga s pomo\u010djo lahkotnega u\u010denja na [&hellip;]<\/p>","protected":false},"author":2,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[66],"tags":[126],"class_list":["post-7527","post","type-post","status-publish","format-standard","hentry","category-programi","tag-google"],"acf":{"subtitle":"Google DeepMind je razkril Vision Banana, revolucionaren model za generiranje slik, ki z uporabo u\u010denja na podlagi navodil dosega izjemne rezultate pri razumevanju vizualnih podatkov. Model je v testih premagal specializirane sisteme, kot sta SAM 3 pri segmentaciji slik in Depth Anything V3 pri ocenjevanju metri\u010dne globine, kar nakazuje na velik premik v razvoju umetne inteligence.","heading":"","summary":"Google DeepMind je predstavil Vision Banana, model, ki z generiranjem slik re\u0161uje kompleksne vizualne naloge. S svojo zmogljivostjo je prehitel specializirana orodja SAM 3 in Depth Anything V3, kar dokazuje mo\u010d generativnega vida.","thumbnail_small":"https:\/\/racunalniske-novice.com\/wp-content\/uploads\/2026\/04\/Gemini-On-Mac-560x315.jpg","thumbnail_large":"https:\/\/racunalniske-novice.com\/wp-content\/uploads\/2026\/04\/Gemini-On-Mac-1024x768.jpg","thumbnail_caption":"Foto: Google","gallery":"","video_gallery":[{"youtube_url":"https:\/\/www.youtube.com\/watch?v=I8VUN141MjU"}],"author":"","links":[{"title":"Google ","url":""}],"sources":null,"skip_language":[]},"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v22.8 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice<\/title>\n<meta name=\"description\" content=\"Google DeepMind je predstavil Vision Banana, model, ki z generiranjem slik re\u0161uje kompleksne vizualne naloge. S svojo zmogljivostjo je prehitel specializirana orodja SAM 3 in Depth Anything V3, kar do\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/posts\/7527\" \/>\n<meta property=\"og:locale\" content=\"hr_HR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice\" \/>\n<meta property=\"og:description\" content=\"Raziskovalna ekipa Google DeepMind je z modelom Vision Banana dokazala, da predhodniki za generiranje slik slu\u017eijo kot mo\u010dni temelji za splo\u0161no razumevanje vizualnega sveta, podobno kot veliki jezikovni modeli (LLM) razvijejo razumevanje jezika skozi napovedovanje naslednje besede. Osnova sistema je Nano Banana Pro, Googlov najnaprednej\u0161i generator slik, ki so ga s pomo\u010djo lahkotnega u\u010denja na [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/viva.racunalniske-novice.com\/hr\/google-deepmind-predstavlja-generalisticki-model-koji-pomice-granice-racunalnog-vida\/\" \/>\n<meta property=\"og:site_name\" content=\"Ra\u010dunalni\u0161ke novice\" \/>\n<meta property=\"article:published_time\" content=\"2026-04-29T04:05:00+00:00\" \/>\n<meta name=\"author\" content=\"sinusiks\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Napisao\/la\" \/>\n\t<meta name=\"twitter:data1\" content=\"sinusiks\" \/>\n\t<meta name=\"twitter:label2\" content=\"Procijenjeno vrijeme \u010ditanja\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 minuta\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/\",\"url\":\"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/\",\"name\":\"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice\",\"isPartOf\":{\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#website\"},\"datePublished\":\"2026-04-29T04:05:00+00:00\",\"dateModified\":\"2026-04-29T04:05:00+00:00\",\"author\":{\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4\"},\"breadcrumb\":{\"@id\":\"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/#breadcrumb\"},\"inLanguage\":\"hr\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/viva.racunalniske-novice.com\/en\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#website\",\"url\":\"https:\/\/viva.racunalniske-novice.com\/en\/\",\"name\":\"Ra\u010dunalni\u0161ke novice\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/viva.racunalniske-novice.com\/en\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"hr\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4\",\"name\":\"sinusiks\",\"sameAs\":[\"https:\/\/ml.racunalniske-novice.com\"],\"url\":\"https:\/\/viva.racunalniske-novice.com\/hr\/author\/sinusiks\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice","description":"Google DeepMind je predstavil Vision Banana, model, ki z generiranjem slik re\u0161uje kompleksne vizualne naloge. S svojo zmogljivostjo je prehitel specializirana orodja SAM 3 in Depth Anything V3, kar do","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/posts\/7527","og_locale":"hr_HR","og_type":"article","og_title":"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice","og_description":"Raziskovalna ekipa Google DeepMind je z modelom Vision Banana dokazala, da predhodniki za generiranje slik slu\u017eijo kot mo\u010dni temelji za splo\u0161no razumevanje vizualnega sveta, podobno kot veliki jezikovni modeli (LLM) razvijejo razumevanje jezika skozi napovedovanje naslednje besede. Osnova sistema je Nano Banana Pro, Googlov najnaprednej\u0161i generator slik, ki so ga s pomo\u010djo lahkotnega u\u010denja na [&hellip;]","og_url":"https:\/\/viva.racunalniske-novice.com\/hr\/google-deepmind-predstavlja-generalisticki-model-koji-pomice-granice-racunalnog-vida\/","og_site_name":"Ra\u010dunalni\u0161ke novice","article_published_time":"2026-04-29T04:05:00+00:00","author":"sinusiks","twitter_card":"summary_large_image","twitter_misc":{"Napisao\/la":"sinusiks","Procijenjeno vrijeme \u010ditanja":"1 minuta"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/","url":"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/","name":"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice","isPartOf":{"@id":"https:\/\/viva.racunalniske-novice.com\/en\/#website"},"datePublished":"2026-04-29T04:05:00+00:00","dateModified":"2026-04-29T04:05:00+00:00","author":{"@id":"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4"},"breadcrumb":{"@id":"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/#breadcrumb"},"inLanguage":"hr","potentialAction":[{"@type":"ReadAction","target":["https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/viva.racunalniske-novice.com\/en\/"},{"@type":"ListItem","position":2,"name":"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida"}]},{"@type":"WebSite","@id":"https:\/\/viva.racunalniske-novice.com\/en\/#website","url":"https:\/\/viva.racunalniske-novice.com\/en\/","name":"Ra\u010dunalni\u0161ke novice","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/viva.racunalniske-novice.com\/en\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"hr"},{"@type":"Person","@id":"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4","name":"sinusiks","sameAs":["https:\/\/ml.racunalniske-novice.com"],"url":"https:\/\/viva.racunalniske-novice.com\/hr\/author\/sinusiks\/"}]}},"_links":{"self":[{"href":"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/posts\/7527","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/comments?post=7527"}],"version-history":[{"count":0,"href":"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/posts\/7527\/revisions"}],"wp:attachment":[{"href":"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/media?parent=7527"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/categories?post=7527"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/hr\/wp-json\/wp\/v2\/tags?post=7527"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}