Highest Voted 'pretrained-models' Questions - Artificial Intelligence Stack Exchange

4

votes

1 answer

What is the difference between fine tuning and variants of few shot learning?

I am trying to understand the concept of fine-tuning and few-shot learning. I understand the need for fine-tuning. It is essentially tuning a pre-trained model to a specific downstream task. However, recently I have seen a plethora of blog posts…

asked Jun 14 '22 at 03:57

Exploring

223
6
16

4

votes

0 answers

How can I improve the performance of a model trained to detect vehicle poses?

I'm looking for some suggestions on how to improve our vehicle image recognition. We have an online marketplace where customers submit photos of their vehicles. The photos need to meet certain requirements before the advert can be approved.…

image-recognition computer-vision image-segmentation transfer-learning pretrained-models

asked Nov 19 '19 at 12:22

mechane

41
4

3

votes

1 answer

How does a software license apply to pretrained models?

Google provides a lot of pretrained tensorflow models, but I cannot find a license. I am interested in the tfjs-models. The code is licensed Apache-2.0, but the models are downloaded by the code, so the license of the repository probably does not…

tensorflow pretrained-models

asked Apr 18 '20 at 14:11

allo

270
1
9

3

votes

1 answer

Are there any better visual models for transfer rather than ImageNet?

Similar to the recent pushes in Pretrained Language Models (BERT, GPT2, XLNet) I was wondering if such a thrust exists in Computer Vision? From my understanding, it seems the community has converged and settled for ImageNet trained classifiers as…

computer-vision transfer-learning pretrained-models

asked Jun 25 '19 at 18:05

mshlis

2,349
7
23

2

votes

1 answer

Should I use pretrained model for image classification or not?

I have thousands of images similar to this. I can classify them using existing metadata to different folders according to gravel product type loaded on the truck. What would be optimal way to train a model for image classification that would be…

image-recognition pretrained-models

asked Apr 21 '23 at 12:06

Vojtěch Dohnal

121
2

2

votes

1 answer

Does BERT freeze the entire model body when it does fine-tuning?

Recently, I came across the BERT model. I did some research and tried some implementations. I wanted to tackle a NER task, so I chose the BertForSequenceClassifications provided by HuggingFace. for epoch in range(1, args.epochs + 1): total_loss…

bert pretrained-models fine-tuning named-entity-recognition

asked Jul 07 '21 at 08:50

Joon

51
1
6

2

votes

1 answer

What are some most promising ways to approximate common sense and background knowledge?

I learned from this blog post Self-Supervised Learning: The Dark Matter of Intelligence that We believe that self-supervised learning (SSL) is one of the most promising ways to build such background knowledge and approximate a form of common sense…

neural-networks pretrained-models self-supervised-learning commonsense-knowledge

asked Mar 20 '21 at 08:11

Lerner Zhang

877
1
7
19

2

votes

1 answer

Which hyperparameters in neural network are accesible to users adjustment

I am new to Neural Networks and my questions are still very basic. I know that most of neural networks allow and even ask user to chose hyper-parameters like: amount of hidden layers amount of neurons in each layer amount of inputs and…

activation-functions hyper-parameters weights pretrained-models

asked Oct 10 '20 at 16:19

Igor

181
10

2

votes

1 answer

How to use pre-trained BERT to extract the vectors from sentences?

I'm trying to extract the vectors from the sentences. Spent soo much time searching for the pre-trained BERT models but found nothing. Is it possible to get the vectors using pre-trained BERT from the data?

natural-language-processing bert pretrained-models

asked Apr 27 '20 at 13:55

Pluviophile

1,223
5
17
37

1

vote

0 answers

What is the difference between prompt tuning and prefix tuning?

I read prompt tuning and prefix tuning are two effective mechanisms to leverage frozen language models to perform downstream tasks. What is the difference between the two and how they work really? Prompt Tuning:…

deep-learning fine-tuning pretrained-models few-shot-learning zero-shot-learning

asked Nov 26 '22 at 20:53

Exploring

223
6
16

1

vote

1 answer

Using a pre-trained model to generate labels to data to then train a model on

I'm trying to set up a pipeline for my ML models to automatically re-train themselves whenever concept drift occurs to recalibrate to the new output distributions. However, I can't get ground-truth from my data without manual labeling, and I want an…

text-classification data-labelling pretrained-models zero-shot-learning automated-machine-learning

asked Oct 05 '22 at 15:18

Sanger Steel

11
1

1

vote

1 answer

How to Train a Decoder for Pre-trained BERT Transformer-Encoder?

Context: I am currently working on an encoder-decoder sequence to sequence model that uses a sequence of word embeddings as input and output, and then reduces the dimensionality of the word embeddings. The word embeddings are created using…

transformer bert pretrained-models seq2seq encoder-decoder

asked Sep 01 '22 at 12:47

nesquick

11
2

1

vote

0 answers

How to scrape product data on supplier websites?

I'm currently trying to build a semantic scraper that can extract product information from different company websites of suppliers in the packaging industry (with as little manual customization per supplier/website as possible). The current approach…

natural-language-processing python pretrained-models fine-tuning

asked Feb 04 '21 at 18:13

johannesha

11
1

1

vote

0 answers

Is it good practice to save NLP Transformer based pre-trained models into file system in production environment

I have developed a multi label classifier using BERT. I'm leveraging Hugging Face Pytorch implementation for transformers. I have saved the pretrained model into the file directory in dev environment. Now, the application is ready to be moved the…

models pytorch word-embedding transformer pretrained-models

asked Jul 27 '20 at 13:12

Murugesh

141
2

1

vote

0 answers

How to measure/estimate the energy consumption of CNN models during testing?

Does someone know a method to estimate / measure the total energy consumption during the test phase of the well-known CNN models? So with a tool or a power meter... MIT has already a tool to estimate the energy consumption but it only works on…

machine-learning deep-learning convolutional-neural-networks comparison pretrained-models

asked Jun 30 '20 at 13:32

Mugiwara San

11
2

Questions tagged [pretrained-models]