Top new questions this week:
|
It is not clear to me what advantage the EDA data visualization provides. By advantage I mean what decision I will make according to one or the other visualization. Could someone give me an example …
|
I have an imbalanced dataset and I want to train a binary classifier to model the dataset. Here was my approach which resulted into (relatively) acceptable performance: 1- I made a random split to get …
|
Currently, I have a dataset with pairs . The idea is to detect any anomaly in these relationships. I was able to just use pandas to do the analysis so far. |Customer|Agent|Duration|Marks|etc |C1…….
|
I am currently doing my master’s thesis and at the end of finishing it, but there are some questions raised by my supervisor. I have answered most of the questions, but only one question is remaining …
|
I am starting with deep learning and decided to code a backpropagation algorithm on Python 3. I have followed many tutorials and have taken as example many programs that work. Yet, for some reason, my …
|
I have 2k time series and want to optimize the hyperparameters of my prophet model. It takes 1 hour to train and evaluate on every time series for each hyperparam combination. So, I want to run it on …
|
I am working in a ML project where I would like to predict the final result of a soccer game. There are three output categories: Local Win, Tie, Visitor Win. In order to do so, I am using stacking …
|
Greatest hits from previous weeks:
|
I am trying to perform k-means clustering on multiple columns. My data set is composed of 4 numerical columns and 1 categorical column. I already researched previous questions but the answers are not …
|
I got this matrix …
|
When I was reading about using StandardScaler, most of the recommendations were saying that you should use StandardScaler before …
|
I’ve been using SQL since 1996, so I may be biased. I’ve used MySQL and SQLite 3 extensively, but have also used Microsoft SQL Server and Oracle. The vast majority of the operations I’ve seen done …
|
After I developed my predictive model using Random Forest I get the following metrics: …
|
I got ValueError when predicting test data using a RandomForest model. My code: …
|
I am trying to understand the key differences between GBM and XGBOOST. I tried to google it, but could not find any good answers explaining the differences between the two algorithms and why xgboost …
|
Can you answer these questions?
|
Can somebody help make sense of these very odd distributions that I obtained from my trained deep learning regression model? The model was trained with either MAE or MSE loss, which is what the …
|
I am looking for an NLP annotation tool/library that supports active learning. I am looking for something that works in this scenario: Annotating N samples. Training a model on the annotated data. …
|