GadgetBond

  • Latest
  • How-to
  • Tech
    • AI
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Add GadgetBond as a preferred source to see more of our stories on Google.
Font ResizerAa
GadgetBondGadgetBond
  • Latest
  • Tech
  • AI
  • Deals
  • How-to
  • Apps
  • Mobile
  • Gaming
  • Streaming
  • Transportation
Search
  • Latest
  • Deals
  • How-to
  • Tech
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • AI
    • Anthropic
    • ChatGPT
    • ChatGPT Atlas
    • Gemini AI (formerly Bard)
    • Google DeepMind
    • Grok AI
    • Meta AI
    • Microsoft Copilot
    • OpenAI
    • Perplexity
    • xAI
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Follow US
AIOpenAITech

OpenAI’s GPT-4 now understands both text and image inputs

By
Shubham Sawarkar
Shubham Sawarkar's avatar
ByShubham Sawarkar
Editor-in-Chief
I’m a tech enthusiast who loves exploring gadgets, trends, and innovations. With certifications in CISCO Routing & Switching and Windows Server Administration, I bring a sharp...
Follow:
- Editor-in-Chief
Mar 15, 2023, 8:47 AM EDT
Share
We may get a commission from retail offers. Learn more
OpenAI's GPT-4 now understands both text and image inputs
(Photo by D koi on Unsplash)
SHARE

OpenAI has announced the launch of GPT-4, the latest iteration of its generative pre-trained transformer system. Unlike its predecessor GPT-3.5, which can only read and respond with text, GPT-4 can generate text on input images. This development comes hot on the heels of Google’s Workspace AI announcement and ahead of Microsoft’s Future of Work event. OpenAI has reportedly spent the past six months refining the system’s performance based on user feedback generated from the recent ChatGPT conversational bot hype. The company claims that GPT-4 exhibits human-level performance on various professional and academic benchmarks.

OpenAI has partnered with Microsoft to develop GPT’s capabilities and has achieved record performance in “factuality, steerability, and refusing to go outside of guardrails” compared to its predecessor. The new system has also outperformed other state-of-the-art large language models (LLMs) in a variety of benchmark tests. GPT-4 will be made available for both ChatGPT and the API, but access will be restricted to ChatGPT Plus subscribers and API waitlist users, respectively. There will also be a usage cap in place for playing with the new model.

The added multi-modal input feature of GPT-4 will generate text outputs based on a wide variety of mixed text and image inputs. This means that users can scan marketing and sales reports, textbooks, shop manuals, and even screenshots, and ChatGPT will summarize the various details into small words that are easy to understand. The recently upgraded system can be customized by the API developer, allowing developers and soon ChatGPT users to prescribe their AI’s style and task by describing those directions in the ‘system’ message.

GPT-4 has been tested by 50 experts in a wide array of professional fields, and the model’s tendency to “hallucinate” facts has been reduced by around 40 percent compared to its predecessor. The new model is also 82 percent less likely to respond to requests for disallowed content. However, OpenAI still strongly recommends that great care should be taken when using language model outputs, particularly in high-stakes contexts, and that the exact protocol should match the needs of a specific use case.

OpenAI’s GPT-4 represents a significant advance in AI technology, with its ability to generate text on input images and improved performance in various benchmarks. As AI continues to develop, it will be interesting to see how GPT-4 and other systems like it can be used in practical applications.


Discover more from GadgetBond

Subscribe to get the latest posts sent to your email.

Leave a Comment

Leave a ReplyCancel reply

Most Popular

Perplexity Computer adds a Command Panel

Summer Sale gives Nothing’s lineup a more tempting price tag

Also Read
Collage of four web-based artifacts created with Claude Code, including an analytics dashboard, a mobile app design showcase, a software migration report, and a systems workflow visualization. The examples demonstrate interactive interfaces, data-rich dashboards, design systems, and technical documentation generated through AI-assisted development.

Live artifacts come to Claude Code

Illustration of a Claude Connectors settings panel with organization-wide access enabled. A large toggle switch labeled “Enable for organization” is turned on, and a hand-shaped cursor points to it. Below, a list of connected apps—Asana, Atlassian, Canva, Figma, and Granola—each displays an enabled blue toggle switch. The interface appears on a light gray background with a clean, minimalist design.

Claude just solved the enterprise AI authorization headache — and it only took one login

Abstract 3D visualization of a connected network represented as a dark globe covered with intersecting lines and glowing spherical nodes. The illuminated points appear linked across the curved surface, symbolizing artificial intelligence, neural networks, global data connections, and knowledge processing.

Perplexity launches Brain for its Computer agent

Simple illustration of a shopping bag with a keyhole symbol on the front, representing secure or private shopping, on a solid orange background.

Anthropic killed the API key (for workloads, at least)

Design editor interface displaying a crowdfunding webpage for Maple Grove Park alongside a Claude Code terminal window. The design canvas shows editable text, fundraising progress, and donation information, while Claude Code is used to synchronize design components between the visual editor and development workflow.

Claude Design adds admin controls, direct editing, and a connector army

Abstract promotional graphic for LifeSciBench featuring layered design elements on a soft blue gradient background with light reflections and blurred yellow highlights. The composition includes a pale yellow rectangle, a scientific-style bar chart with error bars, and a large cropped text block reading “LifeSciBench” in bold black lettering on a light blue panel. The clean, modern layout combines data visualization and branding elements to represent a life sciences benchmarking or evaluation platform.

OpenAI’s GPT-Rosalind leads LifeSciBench — at a 36% pass rate

Abstract science-themed graphic featuring a soft green and blue gradient background with layered geometric shapes. A chemical structure diagram labeled “4-hydroxy-TEMPO” appears in the upper-right section, while large cropped black typography partially displays the letters “Mo.” The composition combines molecular chemistry imagery with modern design elements, suggesting a scientific research, chemistry, or drug discovery platform.

OpenAI’s near-autonomous chemist just proved it can do real wet-lab science

Apple iCloud logo displayed on a blue gradient background. The image features the iCloud cloud icon centered above the “iCloud” wordmark in white, representing Apple’s cloud storage and synchronization service used for backing up data, syncing files, photos, documents, and settings across iPhone, iPad, Mac, Apple Watch, and other Apple devices.

Apple’s new private.icloud.com domain has a downside

Company Info
  • Homepage
  • Support my work
  • Latest stories
  • Company updates
  • GDB Recommends
  • Daily newsletters
  • About us
  • Contact us
  • Write for us
  • Editorial guidelines
Legal
  • Privacy Policy
  • Cookies Policy
  • Terms & Conditions
  • DMCA
  • Disclaimer
  • Accessibility Policy
  • Security Policy
  • Do Not Sell or Share My Personal Information
Socials
Follow US

Disclosure: We love the products we feature and hope you’ll love them too. If you purchase through a link on our site, we may receive compensation at no additional cost to you. Read our ethics statement. Please note that pricing and availability are subject to change.

Copyright © 2026 GadgetBond. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | Do Not Sell/Share My Personal Information.