Is it OK(legitimate approach) to use Generalized Linear Model with Gaussian family distribution to predict discrete quantities by for example rounding the output of Gaussian GLM to the nearest integer?
Using Gaussian family distribution to predict discrete quantities in GLM
797 Views Asked by user1533224 At
1
There are 1 best solutions below
Related Questions in STATISTICS
- How to make pandas show large datasets in output?
- How to construct polygons from a 2D data to compute spatial autocorrelation in R
- Is python statsmodel elastic net regression automatically standard deviation deflated?
- How can I emulate Microsoft Excel's Solver functionality (GRG Nonlinear) in python?
- How do I find the probability that one of my probabilities will occur?
- Timeline-ish data to Occurence/Exposure data
- Handling Error Propagation Above Biological Thresholds in R with predictNLS
- Why is there such a difference between chi2 and mcnemar?
- Handling Nested One-Level Random Effects in Linear Mixed Models in R
- Model failed to converge (gamma model, self-paced reading data)
- How to quantify the consistency of a sequence of predictions, incl. prediction confidence, using standard function from sklearn or a similar library
- P-values for each comparison in a Kruskal post hoc test in R?
- R Metaprop P-value overlapping with forest plot axes
- Monte Carlo simulation Lotto Germany
- How does emmeans adjust the p-values when using "Tukey" as adjustment method? (Solved)
Related Questions in MACHINE-LEARNING
- Trained ML model with the camera module is not giving predictions
- Keras similarity calculation. Enumerating distance between two tensors, which indicates as lists
- How to get content of BLOCK types LAYOUT_TITLE, LAYOUT_SECTION_HEADER and LAYOUT_xx in Textract
- How to predict input parameters from target parameter in a machine learning model?
- The training accuracy and the validation accuracy curves are almost parallel to each other. Is the model overfitting?
- ImportError: cannot import name 'HuggingFaceInferenceAPI' from 'llama_index.llms' (unknown location)
- Which library can replace causal_conv1d in machine learning programming?
- Fine-Tuning Large Language Model on PDFs containing Text and Images
- Sketch Guided Text to Image Generation
- My ICNN doesn't seem to work for any n_hidden
- Optuna Hyperband Algorithm Not Following Expected Model Training Scheme
- How can I resolve this error and work smoothly in deep learning?
- ModuleNotFoundError: No module named 'llama_index.node_parser'
- Difference between model.evaluate and metrics.accuracy_score
- Give Bert an input and ask him to predict. In this input, can Bert apply the first word prediction result to all subsequent predictions?
Related Questions in GAUSSIAN
- Normal Bayes Classification
- Using tft.scale_to_gaussian for preprocessing a dataset without using other tensorflow operations
- How to Implement Back Substitution for Solving Linear Systems in Python?
- A question about odds ratio: Gaussian vs binomial regression 5
- Mahalanobis distance between two multivariate Gaussian distribution
- Clarifying the Process of Generating iid Gaussian Random Vectors in MATLAB
- Assymetric gaussian funciton fitting in Python for time series
- Gauss Method using parallelism in C++
- Attempting a double Gaussian for the data below
- How to minimize the weighted sum of squared errors with bivariate Gaussian model in R in order to estimate model's parameter?
- Diffraction Grating using Fraunhofer diffraction approximation (MATLAB)
- How to compute 2D convolution using 1D convolution over rows and columns?
- Gaussian Pulse generation using python
- Can two Gaussian curves be plotted in different colors on the same plot?
- Problem in applying 2D gaussian method to a raster
Related Questions in GLM
- How do I fix the response: Error in contrasts in R
- How can be the `$ operator is invalid for atomic vectors` error solved?
- Limited Number of Beta Coefficients for 1000 predictors using glm() in R
- Error when trying to fit glmer in R with Poisson distribution: PIRLS step-halvings failed to reduce deviance in pwrssUpdate
- How to extract Correlation of Coefficients table from models like glm?
- How do you fix the pitch/lookAt flip when you rotate pitch past 90 degrees?
- How to force lm() and glm() functions not to refactor weights for linear regression?
- GLM-Small sample size
- F-test and chi-squared test for coefficient significance in sklearn generalized linear models
- GLM Error in R - Getting the message: "Error: no valid set of coefficients has been found: please supply starting values"
- Building Species distribution modells for species within a raster stack
- how do I fix this glmer error in r: PIRLS loop resulted in NaN value
- Robust Poison regression
- How are computed linear predictors in R glm probit?
- Creating Added Variable Plots for GLMM Averaged Models
Related Questions in DISCRETE-SPACE
- Mutate an array of discrete probabilities by excluding one value in C
- Trouble with visualizer.pl
- How to do space discretization in Gekko?
- How to do 1D discrete collision detection as efficiently as possible?
- How to create queue of events in order by date
- Creating discrete-time model
- r/ combinatorial optimization function/package requiring minimal up-front work
- Matlab : Phase space plot
- Sample from a discrete random distribution in Python
- Outlier test in discrete survival analysis
- 100 Point discrete grid in R
- How to make Conditional Probability Tables (CPTs) for Bayesian networks with pymc
- Bezier curves in discrete domain
- Using Gaussian family distribution to predict discrete quantities in GLM
- Matlab's slice() function not working as desired
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
You can do this but it may not be the best thing to do. It really depends on the nature of the data that you are trying to model. It may well be that poission regression is better suited to your needs.
http://en.wikipedia.org/wiki/Poisson_regression
However, there is nothing whatsoever to stop you from actually fitting a linear model to integer valued data but you may have problems when making inference about your data using the model. If you are simply trying to provide a model from which to predict future observations, it may well work nicely even if not theoretically valid.
Clearly given the nature of the model, you may end up predicting utterly ridiculous results - for example, your reponse variable may only make sense over a limited range (say positive integers) but your model could allow prediction of arbitrarily large values (positive AND negative). Model checking steps like residual checking (normality and correlation) may not give the type of results you would normally see when modelling continuous normally distributed responses.
Overall, I would say that depending on your data, your approach COULD generate a useful predictive model but in general you should proceed with caution.
Read this question and some of the answers to it - it discusses similar themes https://stats.stackexchange.com/questions/3024/why-is-poisson-regression-used-for-count-data
To reach a wider audience you might consider posting this question at http://stats.stackexchange.com