SparkR :: gapply How to use LinearRegression across groups in DataFrame?

Lezárva Kiadva: 2 évvel ezelőtt Kiszállításkor fizetve
Lezárva

Hi there

I have big data which I am using for applying linear model to each group. I have small example of the data for the principle I want to have parallelised.

# Determine six waiting times with the largest eruption time in minutes.

schema <- structType(structField("waiting", "double"), structField("max_eruption", "double"))

result <- gapply(

df,

"waiting",

function(key, x) {

y <- [login to view URL](key, max(x$eruptions))

},

schema)

head(collect(arrange(result, "max_eruption", decreasing = TRUE)))

Adatbányászat R Programnyelv

Projektazonosító: #30580205

A projektről

4 ajánlat Távolról teljesíthető projekt Utoljára aktív: 2 évvel ezelőtt

4 szabadúszó tett átlagosan 10€/órás árajánlatot erre a munkára

Annmarie1995

Hi I am a professional statistician with 5 years of experience. I have read the job description. I will help you complete the project. i have skills in Data Mining and R Programming Language. I can deliver quality an Továbbiak

€16 EUR / óra
(23 vélemény)
4.9
WycOj

EXPERT IN STATISTICS Hello there, I am best in statistics, R programming analysis of data, SPSS, Statistical/Data Analysis, Multivariate Statistical Analysis, Regression Analysis, STATA, MINITAB, R language, Factor Ana Továbbiak

€10 EUR / óra
(19 vélemény)
4.4
ibahimakerkouch

Hi, I have a big experience on R programming also I am a master's degree in data science. You can see my profile and my reviews to prove to you that I worked well on R projects. Your project is a challenge for me. Le Továbbiak

€4 EUR / óra
(20 vélemény)
4.3
StatisticandArt

Hi, I graduated Bachelor of Statistics. I have experience using R because that application have been learned when i was college. I am also a specialist in Basic Statistical Analysis (descriptive analysis, graph, chart Továbbiak

€8 EUR / óra
(10 vélemény)
3.2