Top new questions this week:
|
I’m reading about how Conditional Probability/ Bayes Theorem is used in Naive Bayes in Intro to Statistical Learning, but it seems like it isn’t that “groundbreaking” as it is described? If …
|
One of the main concerns of using ChatGPT answers on Stack Exchange is that it may copy verbatim or almost verbatim some text from its training set, which may infringe the source text’s license. This …
|
BACKGROUND: There is a lot of information online about the problem of multicollinearity as it relates to machine learning and how to identify correlated features. However, I am still unclear on which …
|
I mainly tackle regression problems by CNNs, and want to find a reliable method to calculate the heatmaps for NN’s results. However, I find almost all interpretation methods including CAM is used for …
|
In the Keras website, there is an example code of a Variational Autoencoder. At the end of such a page, there is an example code that plots the latent space learned from MNIST. The code is as follows: …
|
Waiting an entire episode before doing a backprop can build up a very large computational graph, which is a burden on memory. Could one still learn a good policy by doing a backprop every fixed number …
|
Truthfulness is a quality that Sam Altman mentioned to be improvable on ChatGPT. This question considers a small example to try to understand the problem. A large language model includes a large set …
|
Greatest hits from previous weeks:
|
I am working on a problem where I need to determine whether two sentences are similar or not. I implemented a solution using BM25 algorithm and wordnet synsets for determining syntactic & …
|
I want to create an AI which can play five-in-a-row/Gomoku. I want to use reinforcement learning for this. I use the policy gradient method, namely REINFORCE, with baseline. For the value and policy …
|
I’ve seen these terms thrown around this site a lot, specifically in the tags convolutional-neural-networks and neural-networks. I know that a neural network is a system based loosely on the human …
|
I’m struggling to understand the GAN loss function as provided in Understanding Generative Adversarial Networks (a blog post written by Daniel Seita). In the standard cross-entropy loss, we have an …
|
I think that the advantage of using Leaky ReLU instead of ReLU is that in this way we cannot have vanishing gradient. Parametric ReLU has the same advantage with the only difference that the slope of …
|
How do I show that uniform-cost search is a special case of A*? How do I prove this?
|
These types of questions may be problem-dependent, but I have tried to find research that addresses the question whether the number of hidden layers and their size (number of neurons in each layer) …
|