pantheonuk
  • Home
  • Business
  • Education
  • Fashion
  • Health
  • Lifestyle
  • News
  • Tech
  • Sports
  • Travel
No Result
View All Result
Pantheonuk.org
  • Home
  • Business
  • Education
  • Fashion
  • Health
  • Lifestyle
  • News
  • Tech
  • Sports
  • Travel
No Result
View All Result
Pantheonuk.org
No Result
View All Result

The Looming Problem of Computation Capacity for Deep Learning Algorithms

Joe Calvin by Joe Calvin
July 30, 2024
in Tech
0
deep learning algorithm
Share on FacebookShare on Twitter

Deep Learning Algorithms

Deep learning is a subset of machine learning algorithms that uses multiple layers of artificial neural networks (ANN) to extract latent features from the input data, eliminating the necessity of manual feature engineering. ANNs are the mathematical model of our human brain.

Deep learning models are popular because they can achieve optimal performance given enough training data and compute power. Also, the growing number of frameworks for model development and deployment has made it easier for any IT company to integrate deep learning models in their businesses. Spell is a popular deep learning model serving platform that makes model deployment and maintenance easier for busy engineers.

But what will happen when we can’t make any more improvements in terms of computation power? This has become a looming problem for deep learning algorithms. To make the context clear, let’s start by looking at some of the recent state-of-the-art models and the resources dedicated to training some of the most comprehensive deep learning models.

  • BERT

BERT stands for Bidirectional Encoder Representation from Transformers. It was developed by Google researchers and took the natural language processing (NLP) community by storm. It was able to obtain high-quality results in a wide range of NLP tasks like question answering, natural language inference, sentiment classification, etc. BERT is a huge model with two variants. BERT large has around 345 million parameters, whereas BERT base has around 110 million parameters. The BERT base was trained on four cloud TPUs for four days, whereas the BERT large was trained on 16 cloud TPUs for four days. The total price of one-time pre-training for the BERT large model can be estimated to be around $7,000.

  • GPT 2

GPT-2 is an extensive language model built by OpenAI recently that can generate realistic text paragraphs. Despite the lack of task-specific training data, the model performs admirably in various linguistic tasks, including machine translation, question answering, reading comprehension, and summarization. GPT has around 1.5 billion parameters. The GPT-2 model was trained with 256 Google Cloud TPU v3 cores, which cost $256 per hour.

  • GPT 3

GPT-3 is the third iteration of GPT models developed by OpenAI. GPT-3’s full version has a capacity of 175 billion machine learning parameters. The high quality of the text generated by GPT-3 makes it difficult to determine whether or not a human wrote it.

GPT-3 showed that a language model with adequate training data could solve NLP tasks it has never seen before.

GPT-3 investigates the model as a comprehensive solution for various downstream operations that do not require fine-tuning. Training GPT-3 on a Tesla V100, the fastest GPU on the market, is estimated to take 355 years, and training GPT-3 on the cheapest GPU cloud service would cost US$4.6 million.

The Price of Progress

In 2018, OpenAI found that since 2012, the amount of computational power used to train the state-of-the-art AI models has doubled every 3.4 months. This significant increase in the resources required demonstrates how expensive the field’s achievements have become. The graph below is a logarithmic scale. The difference becomes massive if we view it on a linear scale.

Researchers from MIT have recently published an article that warn deep learning is approaching its computational limits.

They’ve demonstrated that deep learning is computationally expensive by design, not by accident. Its flexibility, which allows it to predict a broad range of phenomenons and beat expert models, also makes it far more computationally expensive. Progress in training models has been dependent on huge increases in the amount of processing power used in various areas of deep learning.

According to researchers, three years of algorithmic development equates to a tenfold gain in computing capacity. A decade earlier, the ‘AI winter’ came to an end, and new benchmarks for computer performance on a wide range of activities have been set thanks to the surge in processing power employed for deep learning models.

Power Limitations

Deep learning’s voracious thirst for processing power has once again placed a limit on how far it can enhance performance in its current form, especially at a time when hardware performance is stagnating. These computing restrictions will likely force machine learning to use less computationally intensive strategies than deep learning. This can be taken as the looming problem of computation capacity for deep learning algorithms.

Tags: Deep Learning Algorithms

Related Posts

AI-Powered Patient Education Materials: Transforming Communication in Healthcare Practices
Tech

AI-Powered Patient Education Materials: Transforming Communication in Healthcare Practices

AI-Powered Patient Education Materials: Transforming Communication in Healthcare Practices Patient communication has traditionally been one of the most critical...

by Daniel Sams
June 2, 2026
Why Biomass Boiler Maintenance North West Services Are Essential for Commercial Heating Systems
Tech

Why Biomass Boiler Maintenance North West Services Are Essential for Commercial Heating Systems

Heating system installation has become an essential aspect of ensuring business efficiency and productivity in all types of commercial...

by admin
May 30, 2026
How Pinterest Became a Powerful Video Platform and Why Users Save Its Content
Tech

How Pinterest Became a Powerful Video Platform and Why Users Save Its Content

Pinterest has evolved from a simple image-sharing platform into one of the most influential visual discovery engines on the...

by Daniel Sams
May 29, 2026
Security Systems in Canberra
Tech

The Importance of Commercial Security Systems in Canberra

As businesses continue to evolve, the need to secure their physical spaces becomes increasingly critical. Investing in robust commercial...

by Daniel Sams
May 23, 2026
Next Post
Charlotte's 2022 Real Estate Forecast

Charlotte's 2022 Real Estate Forecast

Pantheonuk.org


Pantheonuk.org provides a informative articles about the topics of Business, Tech, Lifestyle, Health, Education, News and Travel. It's UK based blogging sites which covers various topics too.

  • Home
  • About
  • Contact

© 2022 pantheonuk.org

No Result
View All Result
  • Home
  • Business
  • Education
  • Fashion
  • Health
  • Lifestyle
  • News
  • Tech
  • Sports
  • Travel

© 2022 pantheonuk