Table of Contents

Artificial Intelligence Stack Exchange Community Digest

Top new questions this week:

Sutton & Barto: what are parametrized functions?

From “Reinforcement Learning: An introduction (2nd ed.)” by Richard S. Sutton and Andrew G. Barto, on page 59 Instead, the agent would have to maintain $v_\pi$ and $q_\pi$ as parameterized …

reinforcement-learning terminology sutton-barto

asked by SomeoneUnknown Score of 3

answered by nbro Score of 6

How was ChatGPT trained?

I know that large language models like GPT-3 are trained simply to continue pieces of text that have been scraped from the web. But how was ChatGPT trained, which, while also having a good …

chat-bots training-datasets chatgpt

asked by HelloGoodbye Score of 2

answered by Rexcirus Score of 1

What size language model can you train on a GPU with x GB of memory?

I’m trying to figure out what size language model I will be able to train on a GPU with a certain amount of memory. Let’s for simplicity say that 1 GB = 109 bytes; that means that, for example, on a …

natural-language-processing language-model gpu memory

asked by HelloGoodbye Score of 2

Explanation of Cross-Modality Linear Transformer

So I am trying to understand how a Cross-Modality Linear Transformer is different from an a basic transformer. I found the transformer mentioned in this paper. Am I correct in understanding that, the …

transformer

asked by Nikita Belooussov Score of 1

answered by Mariusmarten Score of 1

Is VAE the same as the E-step of the EM algorithm?

EM(Expectation Maximum) Target: maximize $p_\theta(x)$ $ p_\theta(x)=\frac{p_\theta(x, z)}{p_\theta(z \mid x)} \\\\$ Take log on both sides: $ \log p_\theta(x)=\log p_\theta(x, z)-\log p_\theta(z \…

machine-learning variational-autoencoder evidence-lower-bound maximum-likelihood expectation-maximization

asked by Garfield Score of 1

Machine Learning for raw measurement data

i have raw measurement data of different events. My first approach was to calculate features of those events, do scaling, PCA and feature selection and then feed those features to different machine …

machine-learning

asked by toben aus Score of 1

Greatest hits from previous weeks:

What are the differences between A* and greedy best-first search?

What are the differences between the A* algorithm and the greedy best-first search algorithm? Which one should I use? Which algorithm is the better one, and why?

algorithm search comparison a-star

asked by Marosh Fatima Score of 11

answered by nbro Score of 11

How is iterative deepening A* better than A*?

The iterative deepening A* search is an algorithm that can find the shortest path between a designated start node and any member of a set of goals. The A* algorithm evaluates nodes by combining the …

search comparison a-star ida-star

asked by Huma Qaseem Score of 6

answered by nbro Score of 11

Can BERT be used for sentence generating tasks?

I am a new learner in NLP. I am interested in the sentence generating task. As far as I am concerned, one state-of-the-art method is the CharRNN, which uses RNN to generate a sequence of words. …

neural-networks deep-learning natural-language-processing bert text-generation

asked by ch271828n Score of 25

answered by soloice Score of 32

Why is Sanskrit the best language for AI?

According to NASA scientist Rick Briggs, Sanskrit is the best language for AI. I want to know how Sanskrit is useful. What’s the problem with other languages? Are they really using Sanskrit in AI …

comparison programming-languages nasa

asked by Rahul Score of 16

answered by Christian Westbrook Score of 16

What is the difference between tree search and graph search?

I have read various answers to this question at different places, but I am still missing something. What I have understood is that a graph search holds a closed list, with all expanded nodes, so …

comparison definitions search graph-search tree-search

asked by xava Score of 19

answered by Amrinder Arora Score of 18

Could a neural network detect primes?

I am not looking for an efficient way to find primes (which of course is a solved problem). This is more of a “what if” question. So, in theory, could you train a neural network to predict …

neural-networks prediction primality-test