Top new questions this week:
|
Many later RL algorithms like PPO or Duelling DQN estimate the advantage. I am not very sure of how that really helps. For instance, the actor loss for a simple actor critic algorithm is given by – <...
|
In the training of a Generative Adversarial Networks (GAN) system, a perfect discriminator (D) is one which outputs 1 (“true image”) for all images of the training dataset and 0 (“false …
|
As a student of numerical analysis, I can see how mathematical analysis involved in making a language program specifically in the convergence analysis of an approximation method. But, while chatting …
|
Let’s imagine two different use cases for a LLM/GPT-3. Predicting the next most likely word in a sequence using all ~50k words in its dictionary (i.e. the standard method of prompting a LLM) Checking …
|
I’m working my way through how ChatGPT works. So I read that ChatGPT is a generative model. When searching for generative models, I found two defintions: A generative model includes the distribution …
|
I have seen this encoding of an image as a graph: The set of the nodes $V$ is the set of pixels. If the image is of size $10\times10$, then we have $10\cdot10=100$ pixels. Each node has a length 3 …
|
I am working on a classification algorithm for brain rhythms. However, when I implemented the metrics for precision, accuracy, F1 score and recall. My results show that my algorithm has a high …
|
Greatest hits from previous weeks:
|
The skip-gram and continuous bag of words (CBOW) are two different types of word2vec models. What are the main differences between them? What are the pros and cons of both methods?
|
I’m working on a project, where we use an encoder-decoder architecture. We decided to use an LSTM for both the encoder and decoder due to its hidden states. In my specific case, the hidden state of …
|
I’ve got an array of integers ranging from -3 to +3. Example: [1, 3, -2, 0, 0, 1] The array has no obvious pattern since it represents bipolar disorder mood swings. What is the most suitable approach …
|
In reinforcement learning (RL), the credit assignment problem (CAP) seems to be an important problem. What is the CAP? Why is it relevant to RL?
|
What are the limitations of the hill climbing algorithm? How can we overcome these limitations?
|
I’m aware that neural networks are probably not designed to do that, however asking hypothetically, is it possible to train the deep neural network (or similar) to solve math equations? So given the …
|
I think that the advantage of using Leaky ReLU instead of ReLU is that in this way we cannot have vanishing gradient. Parametric ReLU has the same advantage with the only difference that the slope of …
|
Can you answer this question?
|
I am trying to refine my knowledge of AI, but unsupervised learning is a bit of a stumbling block for me. I understand that it finds ‘hidden’ patterns in data, but if they are hidden, how does a user …
|