{"id":696,"date":"2023-09-26T17:48:00","date_gmt":"2023-09-26T15:48:00","guid":{"rendered":"https:\/\/vivalainfo.com\/en\/chatgpt-odslej-razume-tudi-slike-in-glasovne-ukaze\/"},"modified":"2023-09-26T17:48:00","modified_gmt":"2023-09-26T15:48:00","slug":"chatgpt-odslej-razume-tudi-slike-in-glasovne-ukaze","status":"publish","type":"post","link":"https:\/\/viva.racunalniske-novice.com\/en\/chatgpt-now-also-understands-images-and-voice-commands\/","title":{"rendered":"ChatGPT now also understands images and voice commands"},"content":{"rendered":"<p class=\"wp-block-paragraph\">The ChatGPT chatbot is constantly being improved by OpenAI. The new version allows users to activate ChatGPT with voice and images as well, bringing new questions and concerns. So what does the new version bring and when?<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Most of the changes OpenAI is making to ChatGPT relate to what the AI-powered bot can do: what questions it can answer, what information it can access, and so on. This time, however, it\u2019s also changing the way you can use ChatGPT yourself. The company is introducing a new version of the service that lets you interact with the AI bot not just by typing sentences into a text box, but also by talking to it or just uploading an image. The new features will be available to Plus subscribers in the coming weeks, with everyone else getting the new functionality \u201csoon after.\u201d<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The voice command part isn&#039;t anything groundbreaking: you tap a button and say your question, ChatGPT converts it to text and feeds it to a large language model, gets the answer and converts it back to speech and answers you back vocally. It should be similar to talking to Alexa or Google Assistant, except that \u2014 OpenAI hopes \u2014 the answers will be better thanks to improved underlying technology. Most virtual assistants seem to be revamping to include large language models \u2014 and OpenAI is one step ahead of them all.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">OpenAI\u2019s excellent Whisper model does a lot of the speech-to-text conversion, and the company is also introducing a new text-to-speech model that is said to be able to create \u201chuman-like sound, just from text and a few seconds of sample speech.\u201d You\u2019ll be able to choose a voice for ChatGPT from five options, but OpenAI seems to think the model has much more potential. For example, OpenAI is working with Spotify to translate podcasts into other languages while preserving the sound of the person hosting the podcast. There are many interesting uses for synthetic voices, and OpenAI could be a big part of that industry.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Regardless, the fact that you can create a decent synthetic voice with just a few seconds of audio recording opens the door to all sorts of potentially problematic use cases. \u201cThese capabilities introduce new threats, such as the possibility of malicious actors impersonating public figures and the like,\u201d the company wrote in a blog post announcing the new features. That\u2019s why the model isn\u2019t available for general use and will be much more tightly controlled and limited to specific use cases and partnerships.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The image search feature is somewhat similar to Google Lens. You snap a photo and ChatGPT will try to understand what you&#039;re asking and respond accordingly. You can also use the drawing tool in the app to make the question as clear as possible, or speak or type questions related to the picture. This is where the nature of ChatGPT comes in particularly handy: instead of running a search, getting the wrong answer, and then running a new search, you can nudge the bot and improve the answer during the process. This is very similar to what Google is doing with multimodal search.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Obviously, including images in ChatGPT also has its drawbacks. One of them is when you use ChatGPT \u201con a person\u201d: OpenAI says it has deliberately limited \u201cChatGPT\u2019s ability to analyze and make direct statements about people.\u201d Both for the sake of accuracy and privacy. This means that one of the most sci-fi visions of artificial intelligence \u2014 the ability to look at someone and tell who they are \u2014 is not going to be realized anytime soon. Which is probably a good thing.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Almost a year after ChatGPT&#039;s heyday, it seems that OpenAI is still trying to figure out how to give its model more features and capabilities without creating new problems and downsides. With new releases, the company has tried to walk that fine line by consciously limiting what its new models can do. But the fact is that this approach will not always work. As more and more people use voice control and image search, and as ChatGPT moves closer to becoming a truly multi-modal, useful virtual assistant, it will become increasingly difficult to maintain all of these safeguards.<\/p>","protected":false},"excerpt":{"rendered":"<p>Podjetje OpenAI nenehno izbolj\u0161uje klepetalnega robota ChatGPT. Nova razli\u010dica uporabnikom omogo\u010da, da ChatGPT aktivirajo tudi z glasom in slikami, s tem pa se pojavljajo tudi nova vpra\u0161anja in skrbi. Kaj torej prina\u0161a nova razli\u010dica in kdaj? Ve\u010dina sprememb, ki jih OpenAI uvaja v ChatGPT, se nana\u0161a na to, kaj bot, ki ga poganja umetna inteligenca, [&hellip;]<\/p>","protected":false},"author":2,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[66],"tags":[148,192],"class_list":["post-696","post","type-post","status-publish","format-standard","hentry","category-programi","tag-chatgpt","tag-umetna-inteligenca"],"acf":{"subtitle":"","heading":"","summary":"OpenAI nenehno izbolj\u0161uje ChatGPT. Nova razli\u010dica omogo\u010da uporabnikom, da ChatGPT aktivirajo tudi z glasom in slikami, a s tem se pojavljajo tudi nova vpra\u0161anja in skrbi. Kaj torej prina\u0161a nova razli\u010dica in kdaj?","thumbnail_small":"https:\/\/racunalniske-novice.com\/wp-content\/uploads\/2023\/09\/ilgmyzin-agFmImWyPso-unsplash-560x315.jpg","thumbnail_large":"https:\/\/racunalniske-novice.com\/wp-content\/uploads\/2023\/09\/ilgmyzin-agFmImWyPso-unsplash-1024x576.jpg","thumbnail_caption":"","gallery":"","video_gallery":null,"author":"","links":null,"sources":[{"title":"The Verge","url":"https:\/\/www.theverge.com\/2023\/9\/25\/23886699\/chatgpt-pictures-voice-commands-ai-chatbot-openai"},{"title":"Unsplash","url":"https:\/\/unsplash.com\/photos\/agFmImWyPso"}],"skip_language":[]},"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v22.8 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>ChatGPT odslej razume tudi slike in glasovne ukaze - Ra\u010dunalni\u0161ke novice<\/title>\n<meta name=\"description\" content=\"OpenAI nenehno izbolj\u0161uje ChatGPT. Nova razli\u010dica omogo\u010da uporabnikom, da ChatGPT aktivirajo tudi z glasom in slikami, a s tem se pojavljajo tudi nova vpra\u0161anja in skrbi. Kaj torej prina\u0161a nova razli\u010d\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/viva.racunalniske-novice.com\/en\/wp-json\/wp\/v2\/posts\/696\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"ChatGPT odslej razume tudi slike in glasovne ukaze - Ra\u010dunalni\u0161ke novice\" \/>\n<meta property=\"og:description\" content=\"Podjetje OpenAI nenehno izbolj\u0161uje klepetalnega robota ChatGPT. Nova razli\u010dica uporabnikom omogo\u010da, da ChatGPT aktivirajo tudi z glasom in slikami, s tem pa se pojavljajo tudi nova vpra\u0161anja in skrbi. Kaj torej prina\u0161a nova razli\u010dica in kdaj? Ve\u010dina sprememb, ki jih OpenAI uvaja v ChatGPT, se nana\u0161a na to, kaj bot, ki ga poganja umetna inteligenca, [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/viva.racunalniske-novice.com\/en\/chatgpt-now-also-understands-images-and-voice-commands\/\" \/>\n<meta property=\"og:site_name\" content=\"Ra\u010dunalni\u0161ke novice\" \/>\n<meta property=\"article:published_time\" content=\"2023-09-26T15:48:00+00:00\" \/>\n<meta name=\"author\" content=\"sinusiks\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"sinusiks\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/chatgpt-odslej-razume-tudi-slike-in-glasovne-ukaze\/\",\"url\":\"https:\/\/viva.racunalniske-novice.com\/chatgpt-odslej-razume-tudi-slike-in-glasovne-ukaze\/\",\"name\":\"ChatGPT odslej razume tudi slike in glasovne ukaze - Ra\u010dunalni\u0161ke novice\",\"isPartOf\":{\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#website\"},\"datePublished\":\"2023-09-26T15:48:00+00:00\",\"dateModified\":\"2023-09-26T15:48:00+00:00\",\"author\":{\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4\"},\"breadcrumb\":{\"@id\":\"https:\/\/viva.racunalniske-novice.com\/chatgpt-odslej-razume-tudi-slike-in-glasovne-ukaze\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/viva.racunalniske-novice.com\/chatgpt-odslej-razume-tudi-slike-in-glasovne-ukaze\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/chatgpt-odslej-razume-tudi-slike-in-glasovne-ukaze\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/viva.racunalniske-novice.com\/en\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"ChatGPT odslej razume tudi slike in glasovne ukaze\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#website\",\"url\":\"https:\/\/viva.racunalniske-novice.com\/en\/\",\"name\":\"Ra\u010dunalni\u0161ke novice\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/viva.racunalniske-novice.com\/en\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4\",\"name\":\"sinusiks\",\"sameAs\":[\"https:\/\/ml.racunalniske-novice.com\"],\"url\":\"https:\/\/viva.racunalniske-novice.com\/en\/author\/sinusiks\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"ChatGPT odslej razume tudi slike in glasovne ukaze - Ra\u010dunalni\u0161ke novice","description":"OpenAI nenehno izbolj\u0161uje ChatGPT. Nova razli\u010dica omogo\u010da uporabnikom, da ChatGPT aktivirajo tudi z glasom in slikami, a s tem se pojavljajo tudi nova vpra\u0161anja in skrbi. Kaj torej prina\u0161a nova razli\u010d","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/viva.racunalniske-novice.com\/en\/wp-json\/wp\/v2\/posts\/696","og_locale":"en_US","og_type":"article","og_title":"ChatGPT odslej razume tudi slike in glasovne ukaze - Ra\u010dunalni\u0161ke novice","og_description":"Podjetje OpenAI nenehno izbolj\u0161uje klepetalnega robota ChatGPT. Nova razli\u010dica uporabnikom omogo\u010da, da ChatGPT aktivirajo tudi z glasom in slikami, s tem pa se pojavljajo tudi nova vpra\u0161anja in skrbi. Kaj torej prina\u0161a nova razli\u010dica in kdaj? Ve\u010dina sprememb, ki jih OpenAI uvaja v ChatGPT, se nana\u0161a na to, kaj bot, ki ga poganja umetna inteligenca, [&hellip;]","og_url":"https:\/\/viva.racunalniske-novice.com\/en\/chatgpt-now-also-understands-images-and-voice-commands\/","og_site_name":"Ra\u010dunalni\u0161ke novice","article_published_time":"2023-09-26T15:48:00+00:00","author":"sinusiks","twitter_card":"summary_large_image","twitter_misc":{"Written by":"sinusiks","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/viva.racunalniske-novice.com\/chatgpt-odslej-razume-tudi-slike-in-glasovne-ukaze\/","url":"https:\/\/viva.racunalniske-novice.com\/chatgpt-odslej-razume-tudi-slike-in-glasovne-ukaze\/","name":"ChatGPT odslej razume tudi slike in glasovne ukaze - Ra\u010dunalni\u0161ke novice","isPartOf":{"@id":"https:\/\/viva.racunalniske-novice.com\/en\/#website"},"datePublished":"2023-09-26T15:48:00+00:00","dateModified":"2023-09-26T15:48:00+00:00","author":{"@id":"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4"},"breadcrumb":{"@id":"https:\/\/viva.racunalniske-novice.com\/chatgpt-odslej-razume-tudi-slike-in-glasovne-ukaze\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/viva.racunalniske-novice.com\/chatgpt-odslej-razume-tudi-slike-in-glasovne-ukaze\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/viva.racunalniske-novice.com\/chatgpt-odslej-razume-tudi-slike-in-glasovne-ukaze\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/viva.racunalniske-novice.com\/en\/"},{"@type":"ListItem","position":2,"name":"ChatGPT odslej razume tudi slike in glasovne ukaze"}]},{"@type":"WebSite","@id":"https:\/\/viva.racunalniske-novice.com\/en\/#website","url":"https:\/\/viva.racunalniske-novice.com\/en\/","name":"Ra\u010dunalni\u0161ke novice","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/viva.racunalniske-novice.com\/en\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/viva.racunalniske-novice.com\/en\/#\/schema\/person\/afb62e36efa34516d50249517e4cdbb4","name":"sinusiks","sameAs":["https:\/\/ml.racunalniske-novice.com"],"url":"https:\/\/viva.racunalniske-novice.com\/en\/author\/sinusiks\/"}]}},"_links":{"self":[{"href":"https:\/\/viva.racunalniske-novice.com\/en\/wp-json\/wp\/v2\/posts\/696","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/viva.racunalniske-novice.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/viva.racunalniske-novice.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/en\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/en\/wp-json\/wp\/v2\/comments?post=696"}],"version-history":[{"count":0,"href":"https:\/\/viva.racunalniske-novice.com\/en\/wp-json\/wp\/v2\/posts\/696\/revisions"}],"wp:attachment":[{"href":"https:\/\/viva.racunalniske-novice.com\/en\/wp-json\/wp\/v2\/media?parent=696"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/en\/wp-json\/wp\/v2\/categories?post=696"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/viva.racunalniske-novice.com\/en\/wp-json\/wp\/v2\/tags?post=696"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}