Why you need to consider using small data to train AI models & More Latest News Here – Up Jobs
AI models are only as good as their training data.
Some models benefit from large amounts of data. A good example of that is OpenAI’s Dall-E 2, which uses huge volumes of data to translate text and voice to images. Other models do not require a lot of data and would actually not benefit from more.
The idea that small data is just as essential for AI systems and technologies as big data is growing. A 2021 Scientific American article by Georgetown University researchers reported that one approach to small data is first training a model on big data and then retraining the model on a smaller data set. This is known as fine-tuning.
While there are areas in which big data is needed, such as autonomous vehicles, many other AI applications can function with a small amount of data, according to Lewis Z. Liu, co-founder and CEO of Eigen Technologies, a New York-based startup whose AI platform enables enterprises to extract data from documents.
In this Q&A, Liu discusses when small data is preferable to big data and how to make small data relevant.
What’s the benefit of small data AI?
Lewis Z. Liu
Lewis Z. Liu: If you’re small, you have much more control. So you can be conscious about what kind of bias or nonbias [is present]. It’s more about conscious bias versus unconscious bias.
When is small data preferable to big data for an AI model or system?
Liu: I would argue that in the case of intelligent document processing, you want to use small data AI.
On one hand, you have what I call high-bar, low-marginal-value documents. By low marginal value, I mean easy to automate — things like passports, driver’s licenses, W-2 tax forms. Those things are simple and really high volume — most Americans have a W-2 form, right? Half of Americans have passports. Those are easy. Generally, you’ll use the big data approach because you have high volume.
If you’re small, you have much more control. So you can be conscious about what kind of bias or nonbias [is present]. Lewis Z. LiuCo-founder and CEO, Eigen Technologies
But if you look at most invoice processes, your finance department wants to process their invoices, but they may only have 1,000 invoices a year. If you are a Wall Street trader and you’re trading some exotic derivative, they may only issue 200 derivatives. Or you’re an insurance broker that insures residential property, and your brokerage firm may only get 1,000 of these property documents a year.
There are many more use cases and many more document types that are really high value because you’re a lawyer or you’re a banker or you’re an insurance broker looking at these documents, but the document volumes are low per use case. So you actually need small data AI to tackle all these use cases. Furthermore, generally, the people looking at these documents are highly paid. Therefore, you actually get what I like to call ‘lower volume, higher value.’
What happens when small data AI is not enough?
Liu: The data and the documents are just one part of the broader story in the business operation. Sometimes that’s all you need. For some cases, you need to combine the data you get from documents and from other sources. For example, you’re buying a house — you need to look at the title insurance, you need to look at the land grant deed, and you need to look at the homeowner policy. You need to collect data from all of these sources, but you also need to collect data from bank accounts and all those things which are not from documents.
What’s the direction of big data versus small data in AI?
Liu: This is highly use case specific. Big data sets are the future. You need a lot of data to train a self-driving car. There’s no way you can use small data for that. However, in a lot of enterprise applications like intelligent document processing or automated insurance underwriting — where there’s a lot of these use cases, but they’re all very specific — small data is the way to go.
If big data is the future, how can small data AI remain relevant?
Liu: Big data AI is useful for a lot of applications, not all applications.
A human being is versatile, and the whole reason why human beings are so smart is the fact that we are sort of small data machines. We can learn from one or two examples, and then we can do it. If I show you a dance move twice, you can probably do that dance. That flexibility is what makes a human being so versatile in the workplace.
Using small data AI, you have one or two or three training examples, and you can train the AI to do a certain task. It’s that flexibility that makes human beings shine. The future of AI is that some AI systems have that versatility and can shine in that way.
Editor’s note:This Q&A has been edited for clarity and conciseness.
Why you need to consider using small data to train AI models & Latest News Update
I have tried to give all kinds of news to all of you latest news today 2022 through this website and you are going to like all this news very much because all the news we always give in this news is always there. It is on trending topic and whatever the latest news was
it was always our effort to reach you that you keep getting the Electricity News, Degree News, Donate News, Bitcoin News, Trading News, Real Estate News, Gaming News, Trending News, Digital Marketing, Telecom News, Beauty News, Banking News, Travel News, Health News, Cryptocurrency News, Claim News latest news and you always keep getting the information of news through us for free and also tell you people. Give that whatever information related to other types of news will be
Why you need to consider using small data to train AI models & More Live News
All this news that I have made and shared for you people, you will like it very much and in it we keep bringing topics for you people like every time so that you keep getting news information like trending topics and you It is our goal to be able to get
all kinds of news without going through us so that we can reach you the latest and best news for free so that you can move ahead further by getting the information of that news together with you. Later on, we will continue
to give information about more today world news update types of latest news through posts on our website so that you always keep moving forward in that news and whatever kind of information will be there, it will definitely be conveyed to you people.
Why you need to consider using small data to train AI models & More News Today
All this news that I have brought up to you or will be the most different and best news that you people are not going to get anywhere, along with the information Trending News, Breaking News, Health News, Science News, Sports News, Entertainment News, Technology News, Business News, World News of this made available to all of you so that you are always connected with the news, stay ahead in the matter and keep getting today news all types of news for free till today so that you can get the news by getting it. Always take two steps forward
Credit Goes To News Website – This Original Content Owner News Website . This Is Not My Content So If You Want To Read Original Content You Can Follow Below Links