{"id":7527,"date":"2026-04-29T06:05:00","date_gmt":"2026-04-29T04:05:00","guid":{"rendered":"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/"},"modified":"2026-04-29T06:05:00","modified_gmt":"2026-04-29T04:05:00","slug":"google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida","status":"publish","type":"post","link":"https:\/\/viva.racunalniske-novice.com\/es\/google-deepmind-presenta-un-modelo-generalista-que-amplia-los-limites-de-la-vision-artificial\/","title":{"rendered":"Google DeepMind presenta un modelo generalista que ampl\u00eda los l\u00edmites de la visi\u00f3n artificial."},"content":{"rendered":"<p>El equipo de investigaci\u00f3n de Google DeepMind ha demostrado con el modelo Vision Banana que los precursores de la generaci\u00f3n de im\u00e1genes constituyen una base s\u00f3lida para la comprensi\u00f3n general del mundo visual, de forma similar a como los grandes modelos de lenguaje (LLM) desarrollan la comprensi\u00f3n del lenguaje mediante la predicci\u00f3n de la siguiente palabra. El sistema se basa en Nano Banana Pro, el generador de im\u00e1genes m\u00e1s avanzado de Google, que se ha transformado en Vision Banana mediante un aprendizaje ligero basado en instrucciones. La innovaci\u00f3n clave reside en que diversas tareas de visi\u00f3n artificial, como la segmentaci\u00f3n, la determinaci\u00f3n de la profundidad y la estimaci\u00f3n de la normal de la superficie, se han transformado en tareas de generaci\u00f3n de im\u00e1genes RGB.<br><br>Vision Banana obtuvo resultados superiores en entornos sin experiencia previa con conjuntos de datos espec\u00edficos. Super\u00f3 al modelo SAM 3 en segmentaci\u00f3n de im\u00e1genes, alcanzando una puntuaci\u00f3n de m\u00e9trica de profundidad de 0,929 (par\u00e1metro \u03b41), superando al anterior poseedor del r\u00e9cord, Depth Anything V3 (0,918). Lo m\u00e1s destacable es que el modelo no requiere informaci\u00f3n sobre los par\u00e1metros de la c\u00e1mara para determinar la profundidad, lo que hasta ahora hab\u00eda sido un obst\u00e1culo importante para este tipo de sistemas.<br><br>Este enfoque ofrece tres ventajas clave. Un \u00fanico modelo, donde una sola red neuronal puede realizar una amplia gama de tareas, con solo modificar el texto de la solicitud. Se requiri\u00f3 una peque\u00f1a cantidad de datos visuales espec\u00edficos para adaptar el modelo. Adem\u00e1s, a pesar de las nuevas capacidades anal\u00edticas, Vision Banana conserva plenamente su funci\u00f3n original de generar im\u00e1genes fotorrealistas de excelente calidad.<br><br>Los investigadores creen que estamos presenciando un cambio de paradigma en el que el preaprendizaje generativo se convertir\u00e1 en el est\u00e1ndar para la creaci\u00f3n de modelos visuales generales del futuro. Vision Banana no es solo una nueva herramienta, sino una prueba de que la capacidad de crear contenido visual requiere impl\u00edcitamente una comprensi\u00f3n profunda de la geometr\u00eda, la sem\u00e1ntica y las relaciones espaciales del mundo real.<\/p>\n<div class=\"embed-container\"><iframe src=\"https:\/\/www.youtube.com\/embed\/I8VUN141MjU\" frameborder=\"0\" allowfullscreen><\/iframe><\/div><br\/>","protected":false},"excerpt":{"rendered":"<p>Raziskovalna ekipa Google DeepMind je z modelom Vision Banana dokazala, da predhodniki za generiranje slik slu\u017eijo kot mo\u010dni temelji za splo\u0161no razumevanje vizualnega sveta, podobno kot veliki jezikovni modeli (LLM) razvijejo razumevanje jezika skozi napovedovanje naslednje besede. Osnova sistema je Nano Banana Pro, Googlov najnaprednej\u0161i generator slik, ki so ga s pomo\u010djo lahkotnega u\u010denja na [&hellip;]<\/p>","protected":false},"author":2,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[66],"tags":[126],"class_list":["post-7527","post","type-post","status-publish","format-standard","hentry","category-programi","tag-google"],"acf":{"subtitle":"Google DeepMind je razkril Vision Banana, revolucionaren model za generiranje slik, ki z uporabo u\u010denja na podlagi navodil dosega izjemne rezultate pri razumevanju vizualnih podatkov. Model je v testih premagal specializirane sisteme, kot sta SAM 3 pri segmentaciji slik in Depth Anything V3 pri ocenjevanju metri\u010dne globine, kar nakazuje na velik premik v razvoju umetne inteligence.","heading":"","summary":"Google DeepMind je predstavil Vision Banana, model, ki z generiranjem slik re\u0161uje kompleksne vizualne naloge. S svojo zmogljivostjo je prehitel specializirana orodja SAM 3 in Depth Anything V3, kar dokazuje mo\u010d generativnega vida.","thumbnail_small":"https:\/\/racunalniske-novice.com\/wp-content\/uploads\/2026\/04\/Gemini-On-Mac-560x315.jpg","thumbnail_large":"https:\/\/racunalniske-novice.com\/wp-content\/uploads\/2026\/04\/Gemini-On-Mac-1024x768.jpg","thumbnail_caption":"Foto: Google","gallery":"","video_gallery":[{"youtube_url":"https:\/\/www.youtube.com\/watch?v=I8VUN141MjU"}],"author":"","links":[{"title":"Google ","url":""}],"sources":null,"skip_language":[]},"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v22.8 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice<\/title>\n<meta name=\"description\" content=\"Google DeepMind je predstavil Vision Banana, model, ki z generiranjem slik re\u0161uje kompleksne vizualne naloge. S svojo zmogljivostjo je prehitel specializirana orodja SAM 3 in Depth Anything V3, kar do\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/viva.racunalniske-novice.com\/es\/wp-json\/wp\/v2\/posts\/7527\" \/>\n<meta property=\"og:locale\" content=\"es_ES\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice\" \/>\n<meta property=\"og:description\" content=\"Raziskovalna ekipa Google DeepMind je z modelom Vision Banana dokazala, da predhodniki za generiranje slik slu\u017eijo kot mo\u010dni temelji za splo\u0161no razumevanje vizualnega sveta, podobno kot veliki jezikovni modeli (LLM) razvijejo razumevanje jezika skozi napovedovanje naslednje besede. Osnova sistema je Nano Banana Pro, Googlov najnaprednej\u0161i generator slik, ki so ga s pomo\u010djo lahkotnega u\u010denja na [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/viva.racunalniske-novice.com\/es\/google-deepmind-presenta-un-modelo-generalista-que-amplia-los-limites-de-la-vision-artificial\/\" \/>\n<meta property=\"og:site_name\" content=\"Ra\u010dunalni\u0161ke novice\" \/>\n<meta property=\"article:published_time\" content=\"2026-04-29T04:05:00+00:00\" \/>\n<meta name=\"author\" content=\"sinusiks\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Escrito por\" \/>\n\t<meta name=\"twitter:data1\" content=\"sinusiks\" \/>\n\t<meta name=\"twitter:label2\" content=\"Tiempo de lectura\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 minuto\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/\",\"url\":\"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/\",\"name\":\"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice\",\"isPartOf\":{\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#website\"},\"datePublished\":\"2026-04-29T04:05:00+00:00\",\"dateModified\":\"2026-04-29T04:05:00+00:00\",\"author\":{\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4\"},\"breadcrumb\":{\"@id\":\"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/#breadcrumb\"},\"inLanguage\":\"es\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/viva.racunalniske-novice.com\/en\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#website\",\"url\":\"https:\/\/viva.racunalniske-novice.com\/en\/\",\"name\":\"Ra\u010dunalni\u0161ke novice\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/viva.racunalniske-novice.com\/en\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"es\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4\",\"name\":\"sinusiks\",\"sameAs\":[\"https:\/\/ml.racunalniske-novice.com\"],\"url\":\"https:\/\/viva.racunalniske-novice.com\/es\/author\/sinusiks\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice","description":"Google DeepMind je predstavil Vision Banana, model, ki z generiranjem slik re\u0161uje kompleksne vizualne naloge. S svojo zmogljivostjo je prehitel specializirana orodja SAM 3 in Depth Anything V3, kar do","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/viva.racunalniske-novice.com\/es\/wp-json\/wp\/v2\/posts\/7527","og_locale":"es_ES","og_type":"article","og_title":"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice","og_description":"Raziskovalna ekipa Google DeepMind je z modelom Vision Banana dokazala, da predhodniki za generiranje slik slu\u017eijo kot mo\u010dni temelji za splo\u0161no razumevanje vizualnega sveta, podobno kot veliki jezikovni modeli (LLM) razvijejo razumevanje jezika skozi napovedovanje naslednje besede. Osnova sistema je Nano Banana Pro, Googlov najnaprednej\u0161i generator slik, ki so ga s pomo\u010djo lahkotnega u\u010denja na [&hellip;]","og_url":"https:\/\/viva.racunalniske-novice.com\/es\/google-deepmind-presenta-un-modelo-generalista-que-amplia-los-limites-de-la-vision-artificial\/","og_site_name":"Ra\u010dunalni\u0161ke novice","article_published_time":"2026-04-29T04:05:00+00:00","author":"sinusiks","twitter_card":"summary_large_image","twitter_misc":{"Escrito por":"sinusiks","Tiempo de lectura":"1 minuto"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/","url":"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/","name":"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice","isPartOf":{"@id":"https:\/\/viva.racunalniske-novice.com\/en\/#website"},"datePublished":"2026-04-29T04:05:00+00:00","dateModified":"2026-04-29T04:05:00+00:00","author":{"@id":"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4"},"breadcrumb":{"@id":"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/#breadcrumb"},"inLanguage":"es","potentialAction":[{"@type":"ReadAction","target":["https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/viva.racunalniske-novice.com\/en\/"},{"@type":"ListItem","position":2,"name":"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida"}]},{"@type":"WebSite","@id":"https:\/\/viva.racunalniske-novice.com\/en\/#website","url":"https:\/\/viva.racunalniske-novice.com\/en\/","name":"Ra\u010dunalni\u0161ke novice","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/viva.racunalniske-novice.com\/en\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"es"},{"@type":"Person","@id":"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4","name":"sinusiks","sameAs":["https:\/\/ml.racunalniske-novice.com"],"url":"https:\/\/viva.racunalniske-novice.com\/es\/author\/sinusiks\/"}]}},"_links":{"self":[{"href":"https:\/\/viva.racunalniske-novice.com\/es\/wp-json\/wp\/v2\/posts\/7527","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/viva.racunalniske-novice.com\/es\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/viva.racunalniske-novice.com\/es\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/es\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/es\/wp-json\/wp\/v2\/comments?post=7527"}],"version-history":[{"count":0,"href":"https:\/\/viva.racunalniske-novice.com\/es\/wp-json\/wp\/v2\/posts\/7527\/revisions"}],"wp:attachment":[{"href":"https:\/\/viva.racunalniske-novice.com\/es\/wp-json\/wp\/v2\/media?parent=7527"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/es\/wp-json\/wp\/v2\/categories?post=7527"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/es\/wp-json\/wp\/v2\/tags?post=7527"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}