{"id":7527,"date":"2026-04-29T06:05:00","date_gmt":"2026-04-29T04:05:00","guid":{"rendered":"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/"},"modified":"2026-04-29T06:05:00","modified_gmt":"2026-04-29T04:05:00","slug":"google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida","status":"publish","type":"post","link":"https:\/\/viva.racunalniske-novice.com\/fr\/google-deepmind-presente-un-modele-generaliste-qui-repousse-les-limites-de-la-vision-par-ordinateur\/","title":{"rendered":"Google DeepMind pr\u00e9sente un mod\u00e8le g\u00e9n\u00e9raliste qui repousse les limites de la vision par ordinateur"},"content":{"rendered":"<p>L&#039;\u00e9quipe de recherche Google DeepMind a d\u00e9montr\u00e9, gr\u00e2ce au mod\u00e8le Vision Banana, que les pr\u00e9curseurs de la g\u00e9n\u00e9ration d&#039;images constituent une base solide pour la compr\u00e9hension g\u00e9n\u00e9rale du monde visuel, \u00e0 l&#039;instar des grands mod\u00e8les de langage (LLM) qui d\u00e9veloppent la compr\u00e9hension du langage par la pr\u00e9diction du mot suivant. Ce syst\u00e8me repose sur Nano Banana Pro, le g\u00e9n\u00e9rateur d&#039;images le plus avanc\u00e9 de Google, transform\u00e9 en Vision Banana gr\u00e2ce \u00e0 un apprentissage l\u00e9ger bas\u00e9 sur des instructions. L&#039;innovation majeure r\u00e9side dans la transformation de diverses t\u00e2ches de vision par ordinateur, telles que la segmentation, la d\u00e9termination de la profondeur et l&#039;estimation des normales de surface, en t\u00e2ches de g\u00e9n\u00e9ration d&#039;images RGB.<br><br>Vision Banana a obtenu des r\u00e9sultats exceptionnels dans des environnements \u00ab z\u00e9ro-shot \u00bb, o\u00f9 le mod\u00e8le ne dispose d&#039;aucune exp\u00e9rience pr\u00e9alable avec des jeux de donn\u00e9es sp\u00e9cifiques. Il a surpass\u00e9 le mod\u00e8le SAM 3 en segmentation d&#039;images, tout en atteignant un score de profondeur de 0,929 (param\u00e8tre \u03b41), battant ainsi le pr\u00e9c\u00e9dent record d\u00e9tenu par Depth Anything V3 (0,918). Plus impressionnant encore, le mod\u00e8le ne n\u00e9cessite aucune information sur les param\u00e8tres de la cam\u00e9ra pour d\u00e9terminer la profondeur, ce qui constituait jusqu&#039;\u00e0 pr\u00e9sent un obstacle majeur pour ce type de syst\u00e8mes.<br><br>Cette approche pr\u00e9sente trois avantages cl\u00e9s\u00a0: un mod\u00e8le unique o\u00f9 un seul r\u00e9seau neuronal peut r\u00e9aliser un large \u00e9ventail de t\u00e2ches, seul le texte d\u2019invite changeant\u00a0; une quantit\u00e9 r\u00e9duite de donn\u00e9es visuelles sp\u00e9cifiques a \u00e9t\u00e9 n\u00e9cessaire pour adapter le mod\u00e8le\u00a0; et enfin, malgr\u00e9 ses nouvelles capacit\u00e9s d\u2019analyse, Vision Banana conserve pleinement sa fonction premi\u00e8re de g\u00e9n\u00e9ration d\u2019images photor\u00e9alistes exceptionnelles.<br><br>Les chercheurs estiment que nous assistons \u00e0 un changement de paradigme\u00a0: l\u2019apprentissage g\u00e9n\u00e9ratif pr\u00e9alable deviendra la norme pour la construction de mod\u00e8les visuels g\u00e9n\u00e9raux du futur. Vision Banana n\u2019est pas qu\u2019un simple outil\u00a0; il d\u00e9montre que la capacit\u00e9 \u00e0 cr\u00e9er du contenu visuel requiert implicitement une compr\u00e9hension approfondie de la g\u00e9om\u00e9trie, de la s\u00e9mantique et des relations spatiales du monde r\u00e9el.<\/p>\n<div class=\"embed-container\"><iframe src=\"https:\/\/www.youtube.com\/embed\/I8VUN141MjU\" frameborder=\"0\" allowfullscreen><\/iframe><\/div><br\/>","protected":false},"excerpt":{"rendered":"<p>Raziskovalna ekipa Google DeepMind je z modelom Vision Banana dokazala, da predhodniki za generiranje slik slu\u017eijo kot mo\u010dni temelji za splo\u0161no razumevanje vizualnega sveta, podobno kot veliki jezikovni modeli (LLM) razvijejo razumevanje jezika skozi napovedovanje naslednje besede. Osnova sistema je Nano Banana Pro, Googlov najnaprednej\u0161i generator slik, ki so ga s pomo\u010djo lahkotnega u\u010denja na [&hellip;]<\/p>","protected":false},"author":2,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[66],"tags":[126],"class_list":["post-7527","post","type-post","status-publish","format-standard","hentry","category-programi","tag-google"],"acf":{"subtitle":"Google DeepMind je razkril Vision Banana, revolucionaren model za generiranje slik, ki z uporabo u\u010denja na podlagi navodil dosega izjemne rezultate pri razumevanju vizualnih podatkov. Model je v testih premagal specializirane sisteme, kot sta SAM 3 pri segmentaciji slik in Depth Anything V3 pri ocenjevanju metri\u010dne globine, kar nakazuje na velik premik v razvoju umetne inteligence.","heading":"","summary":"Google DeepMind je predstavil Vision Banana, model, ki z generiranjem slik re\u0161uje kompleksne vizualne naloge. S svojo zmogljivostjo je prehitel specializirana orodja SAM 3 in Depth Anything V3, kar dokazuje mo\u010d generativnega vida.","thumbnail_small":"https:\/\/racunalniske-novice.com\/wp-content\/uploads\/2026\/04\/Gemini-On-Mac-560x315.jpg","thumbnail_large":"https:\/\/racunalniske-novice.com\/wp-content\/uploads\/2026\/04\/Gemini-On-Mac-1024x768.jpg","thumbnail_caption":"Foto: Google","gallery":"","video_gallery":[{"youtube_url":"https:\/\/www.youtube.com\/watch?v=I8VUN141MjU"}],"author":"","links":[{"title":"Google ","url":""}],"sources":null,"skip_language":[]},"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v22.8 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice<\/title>\n<meta name=\"description\" content=\"Google DeepMind je predstavil Vision Banana, model, ki z generiranjem slik re\u0161uje kompleksne vizualne naloge. S svojo zmogljivostjo je prehitel specializirana orodja SAM 3 in Depth Anything V3, kar do\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/viva.racunalniske-novice.com\/fr\/wp-json\/wp\/v2\/posts\/7527\" \/>\n<meta property=\"og:locale\" content=\"fr_FR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice\" \/>\n<meta property=\"og:description\" content=\"Raziskovalna ekipa Google DeepMind je z modelom Vision Banana dokazala, da predhodniki za generiranje slik slu\u017eijo kot mo\u010dni temelji za splo\u0161no razumevanje vizualnega sveta, podobno kot veliki jezikovni modeli (LLM) razvijejo razumevanje jezika skozi napovedovanje naslednje besede. Osnova sistema je Nano Banana Pro, Googlov najnaprednej\u0161i generator slik, ki so ga s pomo\u010djo lahkotnega u\u010denja na [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/viva.racunalniske-novice.com\/fr\/google-deepmind-presente-un-modele-generaliste-qui-repousse-les-limites-de-la-vision-par-ordinateur\/\" \/>\n<meta property=\"og:site_name\" content=\"Ra\u010dunalni\u0161ke novice\" \/>\n<meta property=\"article:published_time\" content=\"2026-04-29T04:05:00+00:00\" \/>\n<meta name=\"author\" content=\"sinusiks\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"\u00c9crit par\" \/>\n\t<meta name=\"twitter:data1\" content=\"sinusiks\" \/>\n\t<meta name=\"twitter:label2\" content=\"Dur\u00e9e de lecture estim\u00e9e\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/\",\"url\":\"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/\",\"name\":\"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice\",\"isPartOf\":{\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#website\"},\"datePublished\":\"2026-04-29T04:05:00+00:00\",\"dateModified\":\"2026-04-29T04:05:00+00:00\",\"author\":{\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4\"},\"breadcrumb\":{\"@id\":\"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/#breadcrumb\"},\"inLanguage\":\"fr-FR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/viva.racunalniske-novice.com\/en\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#website\",\"url\":\"https:\/\/viva.racunalniske-novice.com\/en\/\",\"name\":\"Ra\u010dunalni\u0161ke novice\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/viva.racunalniske-novice.com\/en\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"fr-FR\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4\",\"name\":\"sinusiks\",\"sameAs\":[\"https:\/\/ml.racunalniske-novice.com\"],\"url\":\"https:\/\/viva.racunalniske-novice.com\/fr\/author\/sinusiks\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice","description":"Google DeepMind je predstavil Vision Banana, model, ki z generiranjem slik re\u0161uje kompleksne vizualne naloge. S svojo zmogljivostjo je prehitel specializirana orodja SAM 3 in Depth Anything V3, kar do","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/viva.racunalniske-novice.com\/fr\/wp-json\/wp\/v2\/posts\/7527","og_locale":"fr_FR","og_type":"article","og_title":"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice","og_description":"Raziskovalna ekipa Google DeepMind je z modelom Vision Banana dokazala, da predhodniki za generiranje slik slu\u017eijo kot mo\u010dni temelji za splo\u0161no razumevanje vizualnega sveta, podobno kot veliki jezikovni modeli (LLM) razvijejo razumevanje jezika skozi napovedovanje naslednje besede. Osnova sistema je Nano Banana Pro, Googlov najnaprednej\u0161i generator slik, ki so ga s pomo\u010djo lahkotnega u\u010denja na [&hellip;]","og_url":"https:\/\/viva.racunalniske-novice.com\/fr\/google-deepmind-presente-un-modele-generaliste-qui-repousse-les-limites-de-la-vision-par-ordinateur\/","og_site_name":"Ra\u010dunalni\u0161ke novice","article_published_time":"2026-04-29T04:05:00+00:00","author":"sinusiks","twitter_card":"summary_large_image","twitter_misc":{"\u00c9crit par":"sinusiks","Dur\u00e9e de lecture estim\u00e9e":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/","url":"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/","name":"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice","isPartOf":{"@id":"https:\/\/viva.racunalniske-novice.com\/en\/#website"},"datePublished":"2026-04-29T04:05:00+00:00","dateModified":"2026-04-29T04:05:00+00:00","author":{"@id":"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4"},"breadcrumb":{"@id":"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/#breadcrumb"},"inLanguage":"fr-FR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/viva.racunalniske-novice.com\/en\/"},{"@type":"ListItem","position":2,"name":"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida"}]},{"@type":"WebSite","@id":"https:\/\/viva.racunalniske-novice.com\/en\/#website","url":"https:\/\/viva.racunalniske-novice.com\/en\/","name":"Ra\u010dunalni\u0161ke novice","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/viva.racunalniske-novice.com\/en\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"fr-FR"},{"@type":"Person","@id":"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4","name":"sinusiks","sameAs":["https:\/\/ml.racunalniske-novice.com"],"url":"https:\/\/viva.racunalniske-novice.com\/fr\/author\/sinusiks\/"}]}},"_links":{"self":[{"href":"https:\/\/viva.racunalniske-novice.com\/fr\/wp-json\/wp\/v2\/posts\/7527","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/viva.racunalniske-novice.com\/fr\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/viva.racunalniske-novice.com\/fr\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/fr\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/fr\/wp-json\/wp\/v2\/comments?post=7527"}],"version-history":[{"count":0,"href":"https:\/\/viva.racunalniske-novice.com\/fr\/wp-json\/wp\/v2\/posts\/7527\/revisions"}],"wp:attachment":[{"href":"https:\/\/viva.racunalniske-novice.com\/fr\/wp-json\/wp\/v2\/media?parent=7527"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/fr\/wp-json\/wp\/v2\/categories?post=7527"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/fr\/wp-json\/wp\/v2\/tags?post=7527"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}