Top new questions this week:
|
When ChatGPT is generating an answer to my question, it generates it word by word. So I actually have to wait until I get the final answer. Is this just for show? Or is it really real-time generating …
|
Why does A2C use the actual returns from an episode in calculating the advantage instead of using a bellman equation style estimate of the value? Basically, why this: $A(s,a) = \sum_t\gamma^tr_t – V(s)…
|
Is it difficult for other companies to train a model similar to ChatGPT, and what makes it difficult? What is challenging about reproducing the results obtained by OpenAI with ChatGPT/GPT3.5? Would it …
|
I am stuck at the proof of the contraction of variance for distributional Bellman operator from the paper, in which it is defined as and the proof is stated as In its second part, how is the …
|
Soft question here. I was recently learning a bit about how it is feasible to train a transformer on a personal computer like an M1 Mac. I have been told that the model could have 1-3 million …
|
ChatGPT has had a lot of buzz around it recently, and for good reason. It has shown some amazing capabilities in responding to new information, as well as in generalizing new information that has been …
|
I note this question was deemed off-topic, so I’m trying to clearly frame this question in terms of scope of response I’m interested in, namely ethics and sustainability issues associated with the …
|
Greatest hits from previous weeks:
|
I’m trying to create and test non-linear SVMs with various kernels (RBF, Sigmoid, Polynomial) in scikit-learn, to create a model which can classify anomalies and benign behaviors. My dataset includes …
|
In reinforcement learning (RL), the credit assignment problem (CAP) seems to be an important problem. What is the CAP? Why is it relevant to RL?
|
I’m facing the problem of having images of different dimensions as inputs in a segmentation task. Note that the images do not even have the same aspect ratio. One common approach that I found in …
|
I came across this answer on Quora, but it was pretty sparse. I’m looking for specific meanings in the context of machine learning, but also mathematical and economic notions of the term in general.
|
In reinforcement learning, there are the concepts of stochastic (or probabilistic) and deterministic policies. What is the difference between them?
|
I’m a bit confused about the definition of life. Can AI systems be called ‘living’? Because they can do most of the things that we can. They can even communicate with one another. They are not …
|
If the original purpose for developing AI was to help humans in some tasks and that purpose still holds, why should we care about its explainability? For example, in deep learning, as long as the …
|
Can you answer these questions?
|
The game Elite Dangerous has a proceduraly generated galaxy of some 400 billion star systems. Each star system in the game can be uniquely identified bu a 64bit number (id64) which is used as a seed …
|
How does the memory augmented neural network (MANN) work? How can I make a simple MANN with a vanilla neural network especially without a recurrent network?
|