I have a function that I'm applying to different sets of coordinates to create four new columns in my tibble. This function has a pretty long start-up time (loads the genome into RAM, converts tibble to GRanges, and retrieves sequences) but is relatively fast, so that there's not much difference between 100 and 1,000,000 sequences. Is there any way to send each col in the mutate to a different core so they can be processed at the same time? I thought about using pivot_long and then group+partition but this got me thinking about whether there was a different way to accomplish this. A multi_mutate of sorts?
(I don't actually expect the multiplyr partition/collect to be that time-saving in my case given the small cost to additional coordinates, but if I could avoid the time cost of pivoting, which is still relatively small, and mess in my code, that'd be cool.)
Send different dplyr::mutate cols to different cores with multdplyr?
208 Views Asked by GenesRus At
1
There are 1 best solutions below
Related Questions in R
- How to make an R Shiny app with big data?
- How do I keep only specific rows based on whether a column has a specific value?
- Likert scale study - ordinal regression model
- Extract a table/matrix from R into Excel with same colors and stle
- How can I solve non-conformable arguments in R netmeta::discomb (Error in B.matrix %*% C.matrix)?
- Can raw means and estimated marginal means be the same ? And when?
- Understanding accumulate function when .dir is set to "backwards"
- Error in if (nrow(peaks) > 0) { : argument is of length zero Calls: CopywriteR ... tryCatch -> tryCatchList -> tryCatchOne -> <Anonymous> Execution ha
- How to increase quality of mathjax output?
- Convert the time intervals to equal hours and fill in the value column
- How to run an R function getpoints() from IPDfromKM package in an R shiny app which in R pops up a plot that utilizes clicks to capture coordinates?
- Replace NA in list of dfs in certain columns and under certain conditions
- R and text on Cyrillic
- The ts() function in R is returning the correct start and frequency but not end value which is 1 and not 179
- TROUBLING with the "DROP_NA" Function
Related Questions in DPLYR
- Convert the time intervals to equal hours and fill in the value column
- Subsetting rows with sequence of values and identifying columns where sequence begins
- How to change the order of rows?
- Re-arrange rows by longest interval between starttime and endtime
- Mutate based on a condition
- How to iteratively create matrices/vectors from columns/unique row values of dataframe, and pass them to subsequent code?
- In R, how to place error bars at each bar which is stacked, particularly when using facet_grid?
- filter() function not working within the for loop
- matching metadata on multiple nested data frames
- Write custom lazy evaluation function like dbplyr to get SQL
- Why can't I cut a buffer, both in R and QGIS?
- I want to summarize a huge data frame in R in such a way that I only need unique "lat", "lon", "Date (Year)" and "Maxium Value"
- Alternatives for distinct(.keep_all = TRUE) in arrow?
- sparklyr group by mutate with n_distinct
- Order rows by type of character
Related Questions in PARALLEL-PROCESSING
- How to calculate Matrix exponential with Tailor series PARALLEL using MPI c++
- Efficiently processing many small elements of a collection concurrently in Java
- Parallelize filling of Eigen Matrix in C++
- Memory efficient parallel repeated rarefaction with subsequent matrix addition of large data set
- How to publish messages to RabbitMQ by using Multi threading?
- Running a C++ Program with CMake, MPI and OpenCV
- Alternative approach to io.ReadAll to store memory consumption and send a PUT Request with valid data
- Parallelize nested loop with running sum in Fortran
- Can I use parfor within a parfeval in Matlab R2019b and if yes how?
- Parallel testing with cucumber, selenium and junit 5
- Parallel.ForEach vs ActionBlock
- Passing variable to foreach-object -parallel which is with in start-job
- dbatools SQL Functions Not Running In Parallel While SQL Server queries do in Powershell
- How do I run multiple instances of my Powershell function in parallel?
- Joblib.parallel vs concurrent.futures
Related Questions in MULTIDPLYR
- Error with rep using multidplyr: cannot find function "n"
- Creating a frequency 2x2 table in R but replacing frequency data with numerical data from another variable
- How to predict values using random forest model and multidplyr packages in parallel processing mode?
- multidplyr within a for loop- warning: closing unused connections
- How do I amend missing values to 0 when using full_join function in R?
- How can I use apply/lapply on each row of a data frame where calculation requires fetching data from another data frame
- How can I add additional years to my dataset using multidplyr or parallel processing?
- Error: function 'Rcpp_precious_remove' not provided by package 'Rcpp'
- Combine dtplyr and multidplyr to deal with large mutate operation
- Why is left_join creating NAs when values seem to match in the by="x" argument?
- How to set time out in multidplyr
- how to merge two data frame by rows of x and y but columns should be side (df1$x) by side (df2$y)?
- Parallel processing with a function that uses parallel processing?
- merge multiple table with different length and form a single table in R
- Error in is.data.frame(.l) : object 'group' not found
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
I know you were looking for an existing package, but I couldn't find anything on that. Other similar questions (like here or here) appear not to provide a package either..
However, what about you hack it out yourself... Look at this example with
furrr.It needs some testing a guess.. and It would need to be improved.. for example using the same methods available for
mutate. But it's a start.Notice that I need to use
future_options..