{"id":7527,"date":"2026-04-29T06:05:00","date_gmt":"2026-04-29T04:05:00","guid":{"rendered":"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/"},"modified":"2026-04-29T06:05:00","modified_gmt":"2026-04-29T04:05:00","slug":"google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida","status":"publish","type":"post","link":"https:\/\/viva.racunalniske-novice.com\/de\/google-deepmind-prasentiert-ein-generalistisches-modell-das-die-grenzen-der-computer-vision-erweitert\/","title":{"rendered":"Google DeepMind pr\u00e4sentiert ein generalistisches Modell, das die Grenzen der Computer Vision erweitert."},"content":{"rendered":"<p>Das Forschungsteam von Google DeepMind hat mit dem Vision-Banana-Modell gezeigt, dass Vorstufen der Bildgenerierung eine solide Grundlage f\u00fcr das allgemeine Verst\u00e4ndnis der visuellen Welt bilden, \u00e4hnlich wie gro\u00dfe Sprachmodelle (LLMs) das Sprachverst\u00e4ndnis durch die Vorhersage des n\u00e4chsten Wortes entwickeln. Das System basiert auf Nano Banana Pro, Googles fortschrittlichstem Bildgenerator, der durch ressourcenschonendes, instruktionsbasiertes Lernen zu Vision Banana weiterentwickelt wurde. Die zentrale Innovation besteht darin, dass verschiedene Aufgaben der Computer Vision, wie Segmentierung, Tiefenbestimmung und Sch\u00e4tzung von Oberfl\u00e4chennormalen, in RGB-Bildgenerierungsaufgaben umgewandelt wurden.<br><br>Vision Banana erzielte in sogenannten \u201eZero-Shot\u201c-Umgebungen, in denen das Modell keine Vorerfahrung mit spezifischen Datens\u00e4tzen hat, \u00fcberragende Ergebnisse. Es \u00fcbertraf das SAM-3-Modell bei der Bildsegmentierung und erreichte einen Tiefenmetrik-Wert von 0,929 (\u03b41-Parameter), womit es den bisherigen Rekordhalter Depth Anything V3 (0,918) \u00fcbertraf. Besonders beeindruckend ist, dass das Modell keinerlei Informationen \u00fcber Kameraparameter zur Tiefenbestimmung ben\u00f6tigt, was bisher ein gro\u00dfes Hindernis f\u00fcr solche Systeme darstellte.<br><br>Dieser Ansatz bietet drei entscheidende Vorteile: Ein einziges Modell, in dem ein einzelnes neuronales Netzwerk eine Vielzahl von Aufgaben bew\u00e4ltigen kann, wobei sich lediglich die Texteingabe \u00e4ndert. F\u00fcr die Anpassung des Modells war nur eine geringe Menge spezifischer visueller Daten erforderlich. Trotz der neuen Analysem\u00f6glichkeiten beh\u00e4lt Vision Banana weiterhin seine urspr\u00fcngliche Funktion, hervorragende fotorealistische Bilder zu erzeugen.<br><br>Die Forscher gehen davon aus, dass wir einen Paradigmenwechsel erleben, bei dem generatives Vorlernen zum Standard f\u00fcr die Erstellung allgemeiner visueller Modelle der Zukunft wird. Vision Banana ist nicht nur ein neues Werkzeug, sondern auch ein Beweis daf\u00fcr, dass die F\u00e4higkeit zur Erstellung visueller Inhalte implizit ein tiefes Verst\u00e4ndnis von Geometrie, Semantik und r\u00e4umlichen Beziehungen in der realen Welt voraussetzt.<\/p>\n<div class=\"embed-container\"><iframe src=\"https:\/\/www.youtube.com\/embed\/I8VUN141MjU\" frameborder=\"0\" allowfullscreen><\/iframe><\/div><br\/>","protected":false},"excerpt":{"rendered":"<p>Raziskovalna ekipa Google DeepMind je z modelom Vision Banana dokazala, da predhodniki za generiranje slik slu\u017eijo kot mo\u010dni temelji za splo\u0161no razumevanje vizualnega sveta, podobno kot veliki jezikovni modeli (LLM) razvijejo razumevanje jezika skozi napovedovanje naslednje besede. Osnova sistema je Nano Banana Pro, Googlov najnaprednej\u0161i generator slik, ki so ga s pomo\u010djo lahkotnega u\u010denja na [&hellip;]<\/p>","protected":false},"author":2,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[66],"tags":[126],"class_list":["post-7527","post","type-post","status-publish","format-standard","hentry","category-programi","tag-google"],"acf":{"subtitle":"Google DeepMind je razkril Vision Banana, revolucionaren model za generiranje slik, ki z uporabo u\u010denja na podlagi navodil dosega izjemne rezultate pri razumevanju vizualnih podatkov. Model je v testih premagal specializirane sisteme, kot sta SAM 3 pri segmentaciji slik in Depth Anything V3 pri ocenjevanju metri\u010dne globine, kar nakazuje na velik premik v razvoju umetne inteligence.","heading":"","summary":"Google DeepMind je predstavil Vision Banana, model, ki z generiranjem slik re\u0161uje kompleksne vizualne naloge. S svojo zmogljivostjo je prehitel specializirana orodja SAM 3 in Depth Anything V3, kar dokazuje mo\u010d generativnega vida.","thumbnail_small":"https:\/\/racunalniske-novice.com\/wp-content\/uploads\/2026\/04\/Gemini-On-Mac-560x315.jpg","thumbnail_large":"https:\/\/racunalniske-novice.com\/wp-content\/uploads\/2026\/04\/Gemini-On-Mac-1024x768.jpg","thumbnail_caption":"Foto: Google","gallery":"","video_gallery":[{"youtube_url":"https:\/\/www.youtube.com\/watch?v=I8VUN141MjU"}],"author":"","links":[{"title":"Google ","url":""}],"sources":null,"skip_language":[]},"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v22.8 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice<\/title>\n<meta name=\"description\" content=\"Google DeepMind je predstavil Vision Banana, model, ki z generiranjem slik re\u0161uje kompleksne vizualne naloge. S svojo zmogljivostjo je prehitel specializirana orodja SAM 3 in Depth Anything V3, kar do\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/viva.racunalniske-novice.com\/de\/wp-json\/wp\/v2\/posts\/7527\" \/>\n<meta property=\"og:locale\" content=\"de_DE\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice\" \/>\n<meta property=\"og:description\" content=\"Raziskovalna ekipa Google DeepMind je z modelom Vision Banana dokazala, da predhodniki za generiranje slik slu\u017eijo kot mo\u010dni temelji za splo\u0161no razumevanje vizualnega sveta, podobno kot veliki jezikovni modeli (LLM) razvijejo razumevanje jezika skozi napovedovanje naslednje besede. Osnova sistema je Nano Banana Pro, Googlov najnaprednej\u0161i generator slik, ki so ga s pomo\u010djo lahkotnega u\u010denja na [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/viva.racunalniske-novice.com\/de\/google-deepmind-prasentiert-ein-generalistisches-modell-das-die-grenzen-der-computer-vision-erweitert\/\" \/>\n<meta property=\"og:site_name\" content=\"Ra\u010dunalni\u0161ke novice\" \/>\n<meta property=\"article:published_time\" content=\"2026-04-29T04:05:00+00:00\" \/>\n<meta name=\"author\" content=\"sinusiks\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Verfasst von\" \/>\n\t<meta name=\"twitter:data1\" content=\"sinusiks\" \/>\n\t<meta name=\"twitter:label2\" content=\"Gesch\u00e4tzte Lesezeit\" \/>\n\t<meta name=\"twitter:data2\" content=\"1\u00a0Minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/\",\"url\":\"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/\",\"name\":\"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice\",\"isPartOf\":{\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#website\"},\"datePublished\":\"2026-04-29T04:05:00+00:00\",\"dateModified\":\"2026-04-29T04:05:00+00:00\",\"author\":{\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4\"},\"breadcrumb\":{\"@id\":\"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/#breadcrumb\"},\"inLanguage\":\"de\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/viva.racunalniske-novice.com\/en\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#website\",\"url\":\"https:\/\/viva.racunalniske-novice.com\/en\/\",\"name\":\"Ra\u010dunalni\u0161ke novice\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/viva.racunalniske-novice.com\/en\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"de\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4\",\"name\":\"sinusiks\",\"sameAs\":[\"https:\/\/ml.racunalniske-novice.com\"],\"url\":\"https:\/\/viva.racunalniske-novice.com\/de\/author\/sinusiks\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice","description":"Google DeepMind je predstavil Vision Banana, model, ki z generiranjem slik re\u0161uje kompleksne vizualne naloge. S svojo zmogljivostjo je prehitel specializirana orodja SAM 3 in Depth Anything V3, kar do","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/viva.racunalniske-novice.com\/de\/wp-json\/wp\/v2\/posts\/7527","og_locale":"de_DE","og_type":"article","og_title":"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice","og_description":"Raziskovalna ekipa Google DeepMind je z modelom Vision Banana dokazala, da predhodniki za generiranje slik slu\u017eijo kot mo\u010dni temelji za splo\u0161no razumevanje vizualnega sveta, podobno kot veliki jezikovni modeli (LLM) razvijejo razumevanje jezika skozi napovedovanje naslednje besede. Osnova sistema je Nano Banana Pro, Googlov najnaprednej\u0161i generator slik, ki so ga s pomo\u010djo lahkotnega u\u010denja na [&hellip;]","og_url":"https:\/\/viva.racunalniske-novice.com\/de\/google-deepmind-prasentiert-ein-generalistisches-modell-das-die-grenzen-der-computer-vision-erweitert\/","og_site_name":"Ra\u010dunalni\u0161ke novice","article_published_time":"2026-04-29T04:05:00+00:00","author":"sinusiks","twitter_card":"summary_large_image","twitter_misc":{"Verfasst von":"sinusiks","Gesch\u00e4tzte Lesezeit":"1\u00a0Minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/","url":"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/","name":"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida - Ra\u010dunalni\u0161ke novice","isPartOf":{"@id":"https:\/\/viva.racunalniske-novice.com\/en\/#website"},"datePublished":"2026-04-29T04:05:00+00:00","dateModified":"2026-04-29T04:05:00+00:00","author":{"@id":"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4"},"breadcrumb":{"@id":"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/#breadcrumb"},"inLanguage":"de","potentialAction":[{"@type":"ReadAction","target":["https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/viva.racunalniske-novice.com\/google-deepmind-predstavlja-generalisticni-model-ki-premika-meje-racunalniskega-vida\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/viva.racunalniske-novice.com\/en\/"},{"@type":"ListItem","position":2,"name":"Google DeepMind predstavlja generalisti\u010dni model, ki premika meje ra\u010dunalni\u0161kega vida"}]},{"@type":"WebSite","@id":"https:\/\/viva.racunalniske-novice.com\/en\/#website","url":"https:\/\/viva.racunalniske-novice.com\/en\/","name":"Ra\u010dunalni\u0161ke novice","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/viva.racunalniske-novice.com\/en\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"de"},{"@type":"Person","@id":"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4","name":"sinusiks","sameAs":["https:\/\/ml.racunalniske-novice.com"],"url":"https:\/\/viva.racunalniske-novice.com\/de\/author\/sinusiks\/"}]}},"_links":{"self":[{"href":"https:\/\/viva.racunalniske-novice.com\/de\/wp-json\/wp\/v2\/posts\/7527","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/viva.racunalniske-novice.com\/de\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/viva.racunalniske-novice.com\/de\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/de\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/de\/wp-json\/wp\/v2\/comments?post=7527"}],"version-history":[{"count":0,"href":"https:\/\/viva.racunalniske-novice.com\/de\/wp-json\/wp\/v2\/posts\/7527\/revisions"}],"wp:attachment":[{"href":"https:\/\/viva.racunalniske-novice.com\/de\/wp-json\/wp\/v2\/media?parent=7527"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/de\/wp-json\/wp\/v2\/categories?post=7527"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/de\/wp-json\/wp\/v2\/tags?post=7527"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}