Table of Contents

Artificial Intelligence Stack Exchange Community Digest

Top new questions this week:

How are these two equations for the optimal state-value function equivalent?

By substituting the optimal policy $\pi_{\star}$ into the Bellman equation, we get the Bellman equation for $v_{\pi_{\star}}(s)=v_{\star}(s)$: $$ v_{\star}(s) = \sum\limits_a \pi_{\star}(a|s) \sum\…

reinforcement-learning comparison value-functions bellman-equations optimality

asked by DSPinfinity Score of 1

answered by nbro Score of 2

Since ReLU activations also result in a sparse network, does it have the same “feature selection” property as L1 regularization?

From Deep Learning (Courville, Goodfellow, Bengio), a ReLU activation often “dies” because One drawback to rectified linear units is that they cannot learn via gradient based methods on …

machine-learning relu l1-regularization

asked by rac.coon Score of 1

What is wrong with my PyTorch model training on CIFAR10?

I am training a ResNet model on CIFAR10 dataset. For the training subset, I selected a random 1% of the train data from the default train/test split. For the test subset I used the whole default test …

machine-learning pytorch overfitting cifar-10

asked by Liisjak Score of 1

Can an activation function with large derivative cause exploding gradient?

The maximum derivative of most of the currently existing activation functions is around 1. Can an activation function with derivatives higher than 1, say 1000 (a), cause exploding gradient problem? …

neural-networks activation-functions exploding-gradient-problem

asked by JGM Score of 1

Greatest hits from previous weeks:

What is the definition of “soft label” and “hard label”?

In semi-supervised learning, there are hard labels and soft labels. Could someone tell me the meaning and definition of the two things?

comparison terminology definitions semi-supervised-learning labels

asked by ellie Score of 11

answered by Oliver Mason Score of 11

What is the difference between artificial intelligence and robots?

terminology robotics robots comparison

asked by Vishnu JK Score of 8

answered by S.L. Barth Score of 9

Are there other approaches to deal with variable action spaces?

This question is about Reinforcement Learning and variable action spaces for every/some states. Variable action space Let’s say you have an MDP, where the number of actions varies between states (for …

reinforcement-learning reference-request deep-rl function-approximation action-spaces

asked by Rikard Olsson Score of 24

answered by Dennis Soemers Score of 18

How is simulated annealing better than hill climbing methods?

In hill climbing methods, at each step, the current solution is replaced with the best neighbour (that is, the neighbour with highest/smallest value). In simulated annealing, “downhills” moves are …

search comparison hill-climbing simulated-annealing

asked by Huma Qaseem Score of 4

answered by Abdul Rahman Dabbour Score of 2

How is BERT different from the original transformer architecture?

As far as I can tell, BERT is a type of Transformer architecture. What I do not understand is: How is Bert different from the original transformer architecture? What tasks are better suited for BERT,…

natural-language-processing comparison transformer bert

asked by chessprogrammer Score of 19

answered by nbro Score of 20

Which tasks are called as downstream tasks?

The following paragraph is from page no 331 of the textbook Natural Language Processing by Jacob Eisenstein. It mentions about certain type of tasks called as downstream tasks. But, it provide no …

natural-language-processing terminology

asked by hanugm Score of 6

answered by nbro Score of 11

What are the differences between A* and greedy best-first search?

What are the differences between the A* algorithm and the greedy best-first search algorithm? Which one should I use? Which algorithm is the better one, and why?

algorithm search comparison a-star

asked by Marosh Fatima Score of 11

answered by nbro Score of 11

You're receiving this message because you subscribed to the Artificial Intelligence community digest.

Unsubscribe from this community digest Edit email settings Leave feedback Privacy

//sstatic.stackoverflow.email/Img/logo-so-gray@2x.png?v=43e3e57a3d3e” width=”122″ height=”24″ border=”0″ alt=”Stack Overflow” />

Stack Overflow, 110 William Street, 28th floor, New York, NY 10038

<3

Chat read-only to anonymous users. Chat with Anyone and Anywhere. Only registered users are allowed to send messages.

Loading the chat ...

57565 Register Login

Artificial Intelligence Stack Exchange Community Digest

Top new questions this week:

Greatest hits from previous weeks:

Leave a Reply Cancel reply