Highest Voted 'gpt-3' Questions - Artificial Intelligence Stack Exchange

27

votes

1 answer

What is the "temperature" in the GPT models?

What does the temperature parameter mean when talking about the GPT models? I know that a higher temperature value means more randomness, but I want to know how randomness is introduced. Does temperature mean we add noise to the weights/activations…

asked Nov 21 '21 at 01:34

Tom Dörr

393
1
3
7

15

votes

1 answer

What language is the GPT-3 engine written in?

I know that the API is python based, but what's the gpt-3 engine written in mostly? C? C++? I'm having some trouble finding this info.

natural-language-processing programming-languages c++ gpt-3 c

asked May 12 '21 at 00:06

Otherness

275
1
2
6

8

votes

2 answers

Is GPT-4 based on GPT-3 or was it trained from the scratch?

To me it looks like GPT-4 is based on GPT-3. On the other hand, there were rumors that training of GPT-3 was done with errors, but re-train was impossible due to the costs.

open-ai gpt gpt-3 gpt-4

asked Mar 16 '23 at 17:44

Anixx

301
8

8

votes

2 answers

Are GPT-3.5 series models based on GPT-3?

In the official blog post about ChatGPT from OpenAI, there is this paragraph explaining how ChatGPT model was trained: We trained this model using Reinforcement Learning from Human Feedback (RLHF), using the same methods as InstructGPT, but with…

open-ai fine-tuning chatgpt gpt-3

asked Feb 02 '23 at 16:40

iMad

183
4

8

votes

1 answer

What causes ChatGPT to generate responses that refer to itself as a bot or LM?

ChatGPT occasionally generates responses to prompts that refer to itself as a "bot" or "language model." For instance, when given a certain input (the first paragraph of this question) ChatGPT produces (in part) the output: It is not appropriate…

chat-bots training-datasets language-model gpt-3 chatgpt

asked Dec 16 '22 at 08:58

Obie 2.0

183
6

4

votes

1 answer

What's the difference between GPT3.5 and InstructGPT?

I read about the different model series in GPT3.5 here - https://platform.openai.com/docs/models/gpt-3-5 At the beginning of the page, it mentions to look at https://platform.openai.com/docs/model-index-for-researchers to understand the difference…

comparison open-ai gpt gpt-3 instruct-gpt

asked Apr 06 '23 at 08:56

Arya

41
2

4

votes

0 answers

How is ChatGPT maintaining context?

It has been suggested in the answer to this earlier question that it is just remembering a certain amount of recent information. The reference used is this post by OpenAI which says that ChatGPT should only be able to maintain a context of around…

open-ai natural-language-understanding chatgpt gpt-3 natural-language-generation

asked Jan 09 '23 at 03:43

Kay999

41
1
3

4

votes

1 answer

How to get GPT-3 to translate a specific word in a sentence?

I just gave GPT-3 the following prompt (in the playground, using text-davinci-001 with default settings): What's the German word for "can" in the sentence "The man removes the can."? The word "can" in this sentence is obviously a noun and not a…

open-ai gpt-3

asked Jan 21 '22 at 20:51

Jonas Sourlier

151
5

4

votes

3 answers

How can GPT-3 be used for designing electronic circuits from text descriptions?

I was wondering if it is possible to use GPT-3 to translate text description of a circuit to any circuit design language program, which in turn can be used to make the circuit. If it is possible, what approach will you suggest?

generative-adversarial-networks applications text-generation gpt-3

asked Aug 13 '21 at 09:25

Aether

265
2
7

2

votes

1 answer

If GPT-3 is trained on predicting the next token, how is it able to take commands?

From my understanding, GPT-3 is trained on predicting the next token from a sequence of tokens. Given this, how is it able to take commands? For instance, in this example input, wouldn't the statistically most likely prediction be to insert a period…

natural-language-processing transformer gpt-3

asked Oct 04 '22 at 21:46

Andrew Tang

31
2

2

votes

2 answers

How do transformers understand data and answer custom questions?

I recently heard of GPT-3 and I don't understand how the attention models and transformers encoders and decoders work. I heard that GPT-3 can make a website from a description and write perfectly factual essays. How can it understand our world using…

neural-networks transformer human-like gpt-3

asked May 06 '21 at 15:40

DragonflyRobotics

135
7

1

vote

1 answer

Repainting a picture in the style of some painter (or of another picture)

It sounds like a straight-forward task for DALL-E (and GPT?) to present a painting and ask to repaint it "in the style of Leonardo da Vinci". Like one can present texts and ask to rewrite them in the style of some author. Or even better: to present…

chatgpt gpt-3 instruct-gpt

asked Mar 20 '23 at 16:47

Hans-Peter Stricker

811
1
8
20

1

vote

0 answers

How much do we know about the architectures of the Codex (prototype) models?

The transformer model Codex by OpenAI was introduced in a 2021 paper. The paper does not give complete information about the architecture. Below I've quoted all the passages in the paper that give hints as to the architecture: ...we hypothesized…

transformer gpt-3

asked Mar 03 '23 at 15:28

Jack M

242
1
8

1

vote

0 answers

Computation required for GPT model to choose likely word from n-options where n < total vocabulary size

Let’s imagine two different use cases for a LLM/GPT-3. Predicting the next most likely word in a sequence using all ~50k words in its dictionary (i.e. the standard method of prompting a LLM) Checking whether "Word-1" is more likely than "Word-2" to…

natural-language-processing math transformer gpt gpt-3

asked Feb 05 '23 at 00:36

Derek

11
1

1

vote

1 answer

Fine-tune GPT-Neo with prompt and completion?

I'm new to AI and machine learning. To fine-tune GPT-3, I understand that we need a set of training examples that each consist of a single input ("prompt") and its associated output ("completion"). I have prepared a dataset with "prompt" and…

datasets training-datasets gpt fine-tuning gpt-3

asked Dec 04 '22 at 21:13

SoftTimur

111
3

Questions tagged [gpt-3]