Mindtech Launches New Series on Synthetic Data – The Go-To Guide for Anyone Training AI to See and Understand Our World
Guide Provides Industry Analysis on Synthetic Data, Practical Tips for Data Engineers and Real-World Use Cases For Data Training
Part One Reveals How Synthetic Data Resolves Visual AI’s Training Problems
Mindtech Global, developer of the world’s leading platform for the creation of synthetic data for training AI, has launched its first guide on how to use synthetic data to resolve visual AI’s training problems.
From retail to law enforcement, and from healthcare to driverless cars, data scientists the world over are developing powerful visual AI applications that are bringing the benefits of deep machine learning networks to a whole swathe of industries.
However, trouble is stalking visual AI’s brave, new world. A clutch of problematic, real-world data acquisition issues – collectively amounting to what’s being called a data roadblock – are holding up the advancement of visual AI.
The answer to these data roadblocking issues, however, is a relatively simple one: visual AI developers need to augment what real-world data they can acquire with as much synthetic data as they can generate.
Using Chameleon, Mindtech’s synthetic data creation platform, users set up a scene of buildings and environments, and then import all the assets relevant to their application – which could be anything: people, bicycles, cars or crowds in which people mill in multiple directions (with collision detection). They then set up activities, events and “what if” scenarios that will generate images to be captured by one or more virtual cameras in a series of simulation runs – the images that will ultimately form the basis of the data used to train a user’s AI.
Latest Aithority Insights: Detecting, Addressing and Debunking the Hidden AI Biases
Benefits of creating training data this way include it arrives perfectly annotated, privacy-compliant, and ready to use by machine learning engineers and/or data scientists – with no need for 3D graphics expertise on the part of the user.
Chris Longstaff, VP Product Management, Mindtech Global said,
“AI models are infamous for fragility – throwing up bizarre, unexpected results due to the fact that they sometimes generalize from incomplete datasets, or a fault with the model design. For that reason, a synthetic data platform must be capable – as Chameleon is – of reproducing a dataset it once generated at a later time, should anyone need to forensically check why an ML model in development needs troubleshooting.
That key error checking capability ensures those tasked with training AI models can have as much faith in synthetic data as they currently do in real-world data – perhaps even more.”
AI ML in Marketing: AI and Big Data Analysis Used to Find Brands’ Emotional Connection
[To share your insights with us, please write to sghosh@martechseries.com]
Comments are closed.