How can you improve the performance of deep learning models when you have limited training data?

Byivenoidea June 5, 2024March 29, 2025

When training deep learning models, you typically need a large amount of training data to achieve high accuracy. But what happens when you have limited training data? How can you improve the model's performance without collecting more data? Are there any special techniques or methods you can use?

(6 votes)

Programming & Software development

Convert Catrobat/PocketCode projects?

ByAnonymT1 February 28, 2024March 29, 2025

Good day, For a few days now, I've been thinking about whether it's possible to convert Pocket Code projects into regular Java code or something similar. Despite several attempts and approaches, I haven't found a working solution yet. During my research, I noticed that you can unpack project files with the .catrobat extension using 7-zip….

Programming & Software development

Have a question as a newbie to Python?

ByKings007 September 5, 2024March 30, 2025

There's a list function. I've also created one in Python with different numbers in a row. My goal is to generate multiple random numbers, which already works, and the value of the generated two-digit number that is at the end of the list, compared to the other generated numbers, wins. How do I do that?

Programming & Software development

Why doesn't this JavaScript code work?

ByMCStar975 August 18, 2023March 30, 2025

Exam & written exam

Good ghostwriting website for thesis?

ByAlinaa180 December 3, 2023February 19, 2025

Hello, Can anyone recommend a reputable/reliable Can you recommend a ghostwriting website or good AI writing tools? Unfortunately, I have often had negative experiences with classified ads. Greetings!

Programming & Software development

Can doctors implant artificial intelligence into a person to perceive if they are afraid that everything will stop after death so that they have a spirit?

Bycappundkappa June 7, 2023March 30, 2025

Programming & Software development

Binary search with Java, system if number not present?

ByButterkeksLp1 September 29, 2023March 30, 2025

Good morning, The following: I have an array with 25,000 fields, in which square numbers are stored in ascending order. The program currently queries the user for the number it's looking for and then systematically searches the array using approximation. I just have absolutely no idea how to implement it so that it detects when…

1 Answer

Oldest

Newest Most Voted

Inline Feedbacks

View all comments

Michael Förtsch

9 months ago

Many AI researchers/developers are currently breaking their head on this issue. Because what it looks like will soon be the high-quality data to train AI: https://1e9.community/t/den-ki-firmen-wohl-bald-die-daten-aus/20172

But there are already some techniques that are currently being tested or even used in practice.

For example, images can be used twice during the training of image models. For example, by being additionally mirrored, using a different image section or slightly roughened for training. It works in a similar way in texts where already used texts are “formulated” by an AI … and thus suddenly become two from a text.

In particular in image models, the power can also be increased by improving the metadata of the images used. OpenAI did this, for example, during training of DALL-E 3. It let an image captioner analyze the images in the dataset and provide more detailed descriptions that describe not only the image content, but also aesthetics, style and mood in a very detailed way.

Here you can read: https://cdn.openai.com/papers/dall-e-3.pdf

In addition, special techniques are also used to make models learn better from existing data. The Datology AI researches, for example, the so-called curriculum learning, in which data from an AI are presented in such a way that the “learn content” builds on one another and the AI profits optimally. Class is to count instead of mass. Instead of 100 articles on a topic, for example, the 10 best articles on a topic should be enough to achieve a better learning outcome.

However, it is also being researched to optimize the architecture of the models themselves so that they learn more from the existing data. Because such large amounts of data are required, of course, also lies with the models themselves. And there are progress here too. The CM3Leon, developed by Meta, an AI for image generation, should not need more billions, but “only” millions of images as training content to deliver robust results.

Here you can find information: https://ai.meta.com/blog/generative-ai-text-images-cm3leon/

Similar Posts