Top new questions this week:
|
Could you explain if this plot is good or bad. It is a sentiment analysis modelusing LSTM layers.20% of the data used for validation, and the remaining 80% for training.
|
During prediction phase, fully trained supervised models may have to deal with data representing new classes, that weren’t part of the training and test sets. A real world example for this issue is …
|
What is the best way to summarize a long text that exceeds 4096 token limit (like a podcast transcript for example)? As I understand I need to split the text into chunks to summarize, and then …
|
I have created a neural network for sentiment analysis using bidirectional LSTM layers and pre-trained GloVe embeddings. During the training I noticed that the …
|
I have been looking at the scikit learn Canonical Correlation Analysis (CCA) algorithm, and I have come across the terms “weights” and “rotations” as parameters of the CCA model. …
|
For example, I got 5 variables, A to E, for prediction of a value. A B C = A – B D E the output of random forest rank C, B and A are variables with the most importance in descending order, my question …
|
I am working on an analysis using a dictionary-based text-as-data approach. I have a dataset of texts (n=1200), and I am applying a dictionary of 50 words (I tokenize the text with each word being one …
|
Greatest hits from previous weeks:
|
I know that there is a possibility in Keras with the class_weights parameter dictionary at fitting, but I couldn’t find any example. Would somebody so kind to …
|
It seems to me that the $V$ function can be easily expressed by the $Q$ function and thus the $V$ function seems to be superfluous to me. However, I’m new to reinforcement learning so I guess I got …
|
Suppose I have a 5*3 data frame in which third column contains missing value 1 2 3 4 5 NaN 7 8 9 3 2 NaN 5 6 NaN I hope to generate value for missing value based …
|
I am new to pytorch and started with this github code. I do not understand the comment in line 60-61 in the code …
|
Can someone practically explain the rationale behind Gini impurity vs Information gain (based on Entropy)? Which metric is better to use in different scenarios while using decision trees?
|
…
|
I am trying to run SVR using scikit-learn (python) on a training dataset that has 595605 rows and 5 columns (features) while the test dataset has 397070 rows. The data has been pre-processed and …
|
Can you answer these questions?
|
I have a doubt. I am currently using an integrated gradient for the DNN model for explainability. In that, we can specify the baseline as a parameter to the function. I am using all zeros for this. I …
|
Posterior collapse means the variational distribution collapse towards the prior: $\exists i: s.t. \forall x: q_{\phi}(z_i|x) \approx p(z_i)$. $z$ becomes independent of $x$. We would like to avoid it …
vae |
|
|
asked by JXuan Score of 1 |
|
answered by Dan Score of 0 |
|
|
|
I am using the RStudio Server Amazon Machine Image (AMI) for a collaborative statistical learning research project intended to produce a paper for publication because its computational requirements …
|