| SOCR ≫ | DSPA ≫ | Topics ≫ |
Download the Main SOCR Wiki Page and compare RCurl and httr.
Read and write XML for SOCR Main Page.
SOCR Main Page Data Scraping.
Download 03_les miserablese_GraphData.txt
Visualize this undirected network.
Summary the graph and explain the output.
Calculate degree and the centrality of this graph.
Find out some important characters.
Will the result change or not if we assume the graph is directed.
Download CaseStudy12_ AdultsHeartAttack_Data.xlsx or require online.
load this data as data frame.
Use Export() or write.xlsx() to renew the xlsx file.
Use rio package to convert this “.xlsx”" file to “.csv”.
Generate generalizing tabular data structures.
Generate data.table.
Create disk-based data frames and perform basic calculation.
Perform basic calculation on the last 5 columns as a big matrix.
Use DIAGNOSIS, SEX, DRG, CHARGES, LOS and AGE to predict DIED with randomForest setting ntree=20000. Notice: sample without replacement to get an as large as possible balanced dataset.
Run train() in caret and detect the execute time.
Detect cores and make proper number of Clusters.
Rerun train() parallelized and compare the execute time.
Use foreach and doMC to design a parallelized random forest with ntree=20000 totally and compare the execute time with sequential execution.