Top new questions this week:
|
I have read several resources, including previously asked questions such as this. I have also read arguments related to intercepts needed to separate linearly separable data. If my neural network can …
|
I am reading this paper about Group Equivariant Convolutional Networks. Basically, it is a CNN whose construction makes the network naturally equivariant to Group transformations (e.g. rotations) of …
|
Can LSTM model use ReLU or LeakyReLU as the activation funtion? If so, when should one use tanh and when should one use ReLU or LeakyReLU?
|
It is a sentence that I hear a lot and I guess I don’t get what it means. It seems that the weight optimization procedure is very well understood and there is, to some extent, theoretical/empirical …
|
According to OpenAI, ChatGPT is trained in a 3-step process. Are the steps where human AI trainers are involved, i.e. training the initial policy and providing the A>B>C>D grading as …
|
Before I describe my challenge, I want to point out that I have searched extensively online for “outlier image detection”, “anomaly images detection”, etc., but all returned …
|
I have a dataset which has users (rows) with the list of their interests (IABs), which looks like this …
|
Greatest hits from previous weeks:
|
I have been messing around in tensorflow playground. One of the input data sets is a spiral. No matter what input parameters I choose, no matter how wide and deep the neural network I make, I cannot …
|
Suppose that I have 10K images of sizes $2400 \times 2400$ to train a CNN. How do I handle such large image sizes without downsampling? Here are a few more specific questions. Are there any …
|
If the original purpose for developing AI was to help humans in some tasks and that purpose still holds, why should we care about its explainability? For example, in deep learning, as long as the …
|
I am training LSTM neural networks with Keras on a small mobile GPU. The speed on the GPU is slower than on the CPU. I found some articles that say that it is hard to train LSTMs (and, in general, …
|
I’ve heard the terms strong-AI and weak-AI used. Are these well defined terms or subjective ones? How are they generally defined?
|
We often hear that artificial intelligence may harm or even kill humans, so it might prove dangerous. How could artificial intelligence harm us?
|
My understanding is that the convolutional layer of a convolutional neural network has four dimensions: …
|
Can you answer these questions?
|
pass@k is a metric used to evaluate models that generate code, used for example to evaluate Codex. To evaluate pass@k, you have a dataset of natural language/code pairs, and you pass each NL prompt to …
|
I just started learning about AI and have been reading a book called “Foundations of Machine Learning” by Mehryar Mohri so that I can try to create my own. I had a question come up recently: …
|
Prioritized exeperience replay (PER) biases the sampling and introduces importance sampling (IS) correction to the Q-function update. Weights are $w = \frac{1}{N P}^\beta$, where $N$ is the batch size …
|