实验3:常用的数据管理
一:实验目的与要求
1:熟悉和掌握常用的数据管理方法,包括变量重命名、缺失值分析、数据排序、随机抽样、字符串处理、文本分词。
二:实验内容
【创建新变量】
方法1:
|-----------------------------------------------------------------------------------|
| mydata <- iris[,1:2] mydatasquare \<- mydataSepal.Length*mydata$Sepal.Width |
data:image/s3,"s3://crabby-images/4dcaf/4dcaf74faff9a575e76f83f6329829bd803b8b7d" alt=""
方法2:
|-----------------------------------------------------------------------------------------------------------------|
| rm(list=ls()) mydata <- iris[,1:2] attach(mydata) mydata$square <- Sepal.Length*Sepal.Width detach(mydata) |
data:image/s3,"s3://crabby-images/bc51c/bc51c78d0bb20c1de6b17e9aa4b87eecbf5d52a2" alt=""
方法3:使用DT包**(现在已经无法从CRAN下载)**
|----------------------------------------------------------------------------------------------------------------------------------------------|
| rm(list=ls()) mydata <- iris[,1:2] mydata <- transform(mydata,square=Sepal.Length*Sepal.Width) library(DT) datatable(mydata,rownames=F) |
【变量重新编码】
Eg1:
|-----------------------------------------------------------------------------------------------|
| rm(list = ls()) mydata <- mtcars (mydataam \<- ifelse(mydataam == 0,'automatic','manual')) |
data:image/s3,"s3://crabby-images/f6e4a/f6e4af4c3c83b00570f04c5d7677c7edabab54c8" alt=""
Eg2:将99岁这样的年龄重编码为缺失值
|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| manager <- c(1,2,3,4,5) date <- c('10/24/08','10/28/08','10/1/08','10/12/08','5/1/09') country <- c('US','US','UK','UK','UK') gender <- c('M','F','F','M','F') age <- c(32,45,25,39,99) q1 <- c(5,3,3,3,2) q2 <- c(4,5,5,3,2) q3 <- c(5,2,5,4,1) q4 <- c(5,5,5,NA,2) q5 <- c(5,5,2,NA,1) leadership <- data.frame(manager,date,country,gender,age,q1,q2,q3,q4,q5, stringsAsFactors = FALSE) leadershipage\[leadershipage==99] <- NA |
data:image/s3,"s3://crabby-images/68151/68151d7ea3e2205a5f8b7ecec8764320970fbf92" alt=""
Eg3:将年龄离散化,将leadership中age(连续型变量)重新编码为类别型变量agecat
|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| leadershipagecat\[leadershipage > 66] <- "Elder" leadershipagecat\[leadershipage > 40 & leadershipage\<66\] \<- "Middle Aged" leadershipagecat[leadershipage \>=18 \& leadershipage<=40] <- "Young" |
data:image/s3,"s3://crabby-images/e7161/e7161f05f34e760b50da692092f3fffd439912ff" alt=""
Eg4:同上,但使用within函数
|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| leadership <- within(leadership, {agecat <- NA agecat[age > 75] <- 'Elder' agecat[age >-55 & age <=75] <- 'Middle Aged' agecat[age < 55] <- 'Young'}) |
data:image/s3,"s3://crabby-images/10514/10514509e443d35d4cf130299ef716d8a4b7f668" alt=""
【变量的重命名】
Eg1:
|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| install.packages("reshape", repos = "http://cran.r-project.org") w <- mtcars library(reshape) colnames(w) w <- rename(w,c(mpg='Miles/(US) gallon',cy1='Number of Cylinders',disp='Displacement(cu.in)',hp='cross horsepower')) colnames(w) names(w)[5] <- 'Rear axle ratio' names(w) |
data:image/s3,"s3://crabby-images/88037/8803743499cc280a41790d5804ed3a289efefc20" alt=""
【数据排序】
Eg1:
|-------------------------------------------------|
| x <- c(19, 84, 64, 2) order(x) rank(x) sort(x) |
data:image/s3,"s3://crabby-images/fa26b/fa26b7e7da4bb92fcfacd991d7028ed67dff8636" alt=""
【随机抽样】
Eg1:
|-------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| install.packages("sampling",repos = "http://cran.r-project.org") library(sampling) LETTERS (s<-srswor(10,26)) (obs<-((1:26)[s!=0])) (sample<-LETTERS[obs]) |
data:image/s3,"s3://crabby-images/ea201/ea201935c4f6ef79e1c44c7bd2e20b1466aa3ebf" alt=""
Eg2:
|------------------------------------------------------------------------------------------------------------------------------------------|
| library(sampling) LETTERS (s<-srswr(10,26)) (obs<-((1:26)[s!=0])) (n<-s[s!=0]) (obs<-rep(obs,times=n)) (sample<-LETTERS[obs]) |
data:image/s3,"s3://crabby-images/173c7/173c77e782979a60560595f9187bc1a12f9b19a4" alt=""
Eg3:
|------------------------------------------------------------------------------------------|
| library(sampling) LETTERS sample(LETTERS,5,replace=TRUE) sample(LETTERS,5,replace=FALSE) |
data:image/s3,"s3://crabby-images/c8bb4/c8bb45135039314fc6f599cbff28a63ba8333459" alt=""
Eg4:
|------------------------------------------------------------------------------------------------------------------------------------|
| library(sampling) LETTERS n<-sample(2,26,replace=T,prob=c(0.7,0.3)) n (sample1 <- LETTERS[n==1]) (sample2 <- LETTERS[n==2]) |
data:image/s3,"s3://crabby-images/2ad9e/2ad9e47c345f214eddd21b6b4183888ba8c20c63" alt=""
【数学函数】
Eg1:
|--------------------------------------------------------------------------------------------------------------|
| x <- c(1.12, -1.234, 3.1, 2.3, -4) abs(x) sqrt(25) ceiling(x) floor(x) round(x,digits=1) signif(x,digits=1) |
data:image/s3,"s3://crabby-images/26281/26281c93a2f0412999fffc389afa92d92a897b5b" alt=""
【统计函数】
方法1:
|---------------------------------------------------------------------------------------------------------------------|
| mean(rivers) median(rivers) sd(rivers) var(rivers) quantile(rivers,c(.3,.84)) range(rivers) min(rivers) max(rivers) |
data:image/s3,"s3://crabby-images/b79c3/b79c34bc972f0deac5f6d9e0db553accac85c3ac" alt=""
【基本函数】
Eg1:鸢尾花数据集操作
|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| head(iris) colnames(iris) names(iris) sum(irisSepal.Length) max(irisSepal.Length) min(irisSepal.Length) which.max(irisSepal.Length) which.min(irisSepal.Length) range(irisSepal.Length) |
data:image/s3,"s3://crabby-images/29412/29412bf8a50dedba09e9cb672eee5be07022e1c7" alt=""
【高级函数】
Eg1:鸢尾花数据集操作
|----------------------------------------------------------------------------------------------------------------|
| (x<-rnorm(1)) round(x,2) index<-sample(1:150,30) iris1<-iris$Sepal.Length[index] sort(iris1) order(iris1) |
data:image/s3,"s3://crabby-images/db2ba/db2baf3d458772085a10d2706a64543e98ee3b7c" alt=""
【概率函数】
Eg1:
|----------------------------------------------------------------------|
| data<-rnorm(20) data dnorm(data) pnorm(data) qnorm(0.9,mean=0,sd=1) |
data:image/s3,"s3://crabby-images/f90e6/f90e621f0d94067c65933c6bb620a44caddb0c7b" alt=""
【字符串操作】
Eg1:字符串长度
|-------------------------------|
| data<-'R语言是一门艺术' nchar(data) |
data:image/s3,"s3://crabby-images/de8f5/de8f5c7e9e8323719a304763674716b19f37874a" alt=""
Eg2:字符串合并
|------------------------------------------|
| data1<-'用心去体会' paste(data,data1,sep="") |
data:image/s3,"s3://crabby-images/3c40c/3c40c243d192bfcb60129fa9b670ff70c9aeaade" alt=""
Eg3:paste函数合并
|-----------------------------------------------------------------------------------------------------------------------------------------------------|
| paste("AB",1:5,sep="") x<-list(a="1st",b="2nd",c="3rd") y<-list(d=1,e=2) paste(x,y,sep="-") paste(x,y,sep="-",collapse=";") paste(x,collapse=",") |
data:image/s3,"s3://crabby-images/d9f9f/d9f9ff66872f0055a8540d5d67021be337e3721d" alt=""
Eg4:分割字符串
|---------------------------------------------------------------------------------------|
| data<-c("2016年1月1日","2016年2月1日") strsplit(data,"年") strsplit(data,"年")[[1]][1] |
data:image/s3,"s3://crabby-images/5f8be/5f8bee97e9fe1010a8c5b75f554f3ac23e569ef3" alt=""
Eg5:字符串替换
|---------------------------------------------------------------------------------------------------------------------|
| txt <- c("Whatever","is","worth","doing","is","worth","doing","well") sub("[tr]","k",txt) gsub("[tr]","k",txt) |
data:image/s3,"s3://crabby-images/e974f/e974f17da0d76ebb71ae801e35ffaac66f08bab6" alt=""
Eg6:
|--------------------------------------------------------|
| (x<-'abadfa') (substr(x,2,4)) substr(x,2,4)<-'kkk' x |
data:image/s3,"s3://crabby-images/cfab3/cfab318c3d201c18eff597118eba8ac8df8fe7f6" alt=""
Eg7:
|---------------------------------------------------------------------------------------------------------------------------|
| (x<-'abadfaf') (x<-chartr('k','h',x)) (x<-chartr('faf','tBt',x)) (x<-chartr('faf','ttt',x)) (x<-chartr('af','Tt',x)) |
data:image/s3,"s3://crabby-images/4cfec/4cfec4fb74a6963a9a49732ac83186fae51df021" alt=""
【jiebaR】
Eg1:(由于未找到data.txt文件,因此未对文件进行分词)
|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| install.packages("jiebaR",repos = "http://cran.r-project.org") library(jiebaR) mixseg <- worker() mpseg <- worker(type="mp") hmmseg <- worker(type="hmm") word = "人们都说桂林山水甲天下" mixseg <= word mpseg <= word hmmseg <= word segment(word,mixseg) qseg <= word qseg[word] |
data:image/s3,"s3://crabby-images/d351f/d351f61a404ad2eae0613ac9705a97aa1db874f1" alt=""
【apply家族】
Eg1:鸢尾花维度处理
|------------------------------------------------------------------------|
| apply(iris[,1:4],1,mean) sapply(iris,mean) lapply(iris[,1:4],mean) |
data:image/s3,"s3://crabby-images/bbf3e/bbf3e11db23c8e994f0661bd877312b5e94f3db4" alt=""
data:image/s3,"s3://crabby-images/7ddcf/7ddcf6d6dc500df1a66cab118151c32a9b629f1f" alt=""
Eg2:对列表x的每一个元素计算均值
|---------------------------------------------|
| x <- list(a=1:5,b=exp(0:3)) lapply(x,mean) |
data:image/s3,"s3://crabby-images/b90bc/b90bc21823eb186ba2060bcf40221b7bce3cf6ab" alt=""
Eg3:计算矩阵x各行各列的均值
|-----------------------------------------------------------|
| x<-matrix(1:20,ncol=4) x apply(x,1,mean) apply(x,2,mean) |
data:image/s3,"s3://crabby-images/d8107/d8107b872600e12dbf17d35896c7dd122c1e7a0e" alt=""
Eg4:列表list中的元素与数字1-3连接,并以矩阵和列表两种形式输出
|---------------------------------------------------------------------------------------------------------------------|
| list<-list(c("a","b","c"),c("A","B","C")) list sapply(list,paste,1:3,simplify=T) sapply(list,paste,1:3,simplify=F) |
data:image/s3,"s3://crabby-images/b7709/b77099b87438cad0ec72ab102bcf5de620974f7d" alt=""
【数据汇总统计】
Eg1:
|-----------------------------------------------------------------------------------------|
| attach(mtcars) colnames(mtcars) aggregate(mtcars[,c(1,3)],by=list(cyl,gear),FUN=mean) |
data:image/s3,"s3://crabby-images/61c1b/61c1b2270972eea19a4427481f8acb91737707ff" alt=""
【数据融合】
Eg1:melt函数融合数据框和数组
|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| library(reshape) head(airquality) air_melt <- melt(airquality,id=c("Ozone","Month","Day"),na.rm=TRUE) head(air_melt) a <- array(c(1:11,NA),c(2,3,2)) a a_melt <- melt(a,na.rm=TRUE,varnames=c("X","Y","Z")) a_melt |
data:image/s3,"s3://crabby-images/e9132/e9132cf7acb4ec8b2b60413810650238190284d7" alt=""
data:image/s3,"s3://crabby-images/6c256/6c256a80bfb9ae572893b0343948a6d14065733f" alt=""
【数据重塑】
Eg1:
|-------------------------------------------------------------------------------------------------------------------------|
| install.packages("reshape2") library(reshape2) air_cast <- dcast(air_melt,Month~variable,fun.aggregate=mean) air_cast |
data:image/s3,"s3://crabby-images/b8fd7/b8fd7d21c3cc4657465f8a1faf09f613af7540d5" alt=""
【粘贴数据结构】
Eg1:数据框加载
|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| authors <- data.frame( surname=I(c('Tukey','venables','Tierney','Ripley','McNeil')), nationality=c('US','Australia','US',"UK",'Australia'), deceased=c('yes',rep('no',4)) ) books<-data.frame( name=I(c('Tukey','Venables','Tierney','Ripley','Ripley','McNeil','R core')), title=c('Exploratory Data Analysis', 'Modern Applied statistics...', 'LISP-STAT', 'Spatial Statistics', 'Stochastic Simulation', 'Interactive Data Analysis', 'An Introduction to R'), other.author = c(NA,'Ripley',NA,NA,NA,NA,'Venables & Smi') ) |
data:image/s3,"s3://crabby-images/6e33c/6e33c6e21267004065618e010b3601e702d79ff7" alt=""
Eg2:merge函数
|--------------------------------------------------------------------------------------------------------------------|
| (m1<-merge(authors,books,by.x='surname',by.y='name',all=F)) merge(authors,books,by.x='surname',by.y='name',all=T) |
data:image/s3,"s3://crabby-images/715f1/715f113481c73fc05b20b90ecb6f6572e85a683c" alt=""
【转换函数】
Eg1:transform函数
|--------------------------------------------------------------------------------------------------|
| head(airquality) head(transform(airquality,Ozone=-Ozone)) head(transform(airquality,new=-Ozone)) |
data:image/s3,"s3://crabby-images/7a358/7a358b05da634ecd3ffb86774043dccd6e1303a6" alt=""
【while循环语句】
Eg1:
|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| pv <- c(1,1,2,3,1,1,15,7,18) i<-1 result<-"" length(pv) while(i<=length(pv)){ if(pv[i]<=5){ result[i] <- "初级用户" }else if(pv[i]<=15){ result[i] <- "中级用户" }else{ result[i] <- "高级用户" } i <- i+1 } result |
data:image/s3,"s3://crabby-images/e9974/e997418d9e68cb15d25221e63e8672de1e8ac678" alt=""
【for循环】
Eg1:不知道循环次数
|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| pv <- c(1,1,2,3,1,1,15,7,18,1,1,2,3,1,1) result<-"" m<-1 for(i in pv){ if(i<=5){ result[m]<-"初级用户"; }else if(i<=15){ result[m]<-"中级用户"; }else{ result[m]<-"高级用户"; } m<-m+1 } result |
data:image/s3,"s3://crabby-images/46841/46841a54e41be6ccb546bbf155595e3b561c9414" alt=""
Eg2:知道循环次数
|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| pv <- c(1,1,2,3,1,1,15,7,18,1,1,2,3,1,1) result<-"" for(i in 1:length(pv)){ if(pv[i]<=5){ result[i]<-"初级用户"; }else if(pv[i]<=15){ result[i]<-"中级用户"; }else{ result[i]<-"高级用户"; } } result |
data:image/s3,"s3://crabby-images/1a53c/1a53c7c47454b27ec7893e750a897ab2d7aada8a" alt=""
【repeat-break循环】
Eg1:
|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| pv <- c(1,1,2,3,1,1,15,7,18) i<-1 result<-"" repeat{ if(i>length(pv)){ break } if(pv[i]<=5){ result[i]<-"初级用户" }else if(pv[i]<=15){ result[i]<-"中级用户" }else{ result[i]<-"高级用户" } i<-i+1 } result |
data:image/s3,"s3://crabby-images/5d844/5d84439dfa5069a954f3fbef66b427382438c5b7" alt=""
【自定义函数】
Eg1:自定义一个求两数之和的函数
|-------------------------------------------------|
| s <- function(x,y){ a<-x+y return(a) } s(2,3) |
data:image/s3,"s3://crabby-images/09f00/09f00f1d70872685417f03b1b414c39a7cf791ce" alt=""
三:课堂练习
【练习1】PPT-05第27页
案例1:
|------------------------------------------------------------------------------------------------------------------------|
| scale(cars,center=T,scale=F) # centrialize scale(cars,center=T,scale=T) # standarlize diff(cars[,1]) # differentiate |
data:image/s3,"s3://crabby-images/a26f6/a26f6181c57ed47d823fc2b93200bb254ca3e5f1" alt=""
data:image/s3,"s3://crabby-images/cce7c/cce7c0074b0bc839080453c8c396acca15cc73e9" alt=""
案例2:
|------------------------------------------------------------------------------------------------------------------------------|
| date <- c("2016-01-27","2016-02-27") difftime(date[2],date[1],units="days") difftime(date[2],date[1],units="weeks") |
data:image/s3,"s3://crabby-images/10550/1055010645cb79d797edf462b60e426aa6518902" alt=""
【练习2】PPT-05第27页
案例1:subset函数选定特定行或子集
|---------------------------------------------------------|
| subset(airquality, Temp > 80, select = c(Ozone, Temp)) |
data:image/s3,"s3://crabby-images/f2e99/f2e99dc0ab47537986ca47e7a7b1fcce1dd87b9f" alt=""
data:image/s3,"s3://crabby-images/63e84/63e84a86227dd83ebfd304e9bd89839fa4645923" alt=""
data:image/s3,"s3://crabby-images/e3c09/e3c09db169e39e8403cfe24a56de1b379ec27301" alt=""
案例2:
|--------------------------------------------|
| subset(airquality, Day == 1, select=-Temp) |
data:image/s3,"s3://crabby-images/36499/3649943289f54ea256797ef2ac30a35c2092f2d0" alt=""
案例3:
|-----------------------------------------|
| subset(airquality, select = Ozone:Wind) |
data:image/s3,"s3://crabby-images/4dfaa/4dfaa45f0879b4d4fdfad6a343ecaedf2414236d" alt=""
详细结果如下表所示
|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Ozone Solar.R Wind 1 41 190 7.4 2 36 118 8.0 3 12 149 12.6 4 18 313 11.5 5 NA NA 14.3 6 28 NA 14.9 7 23 299 8.6 8 19 99 13.8 9 8 19 20.1 10 NA 194 8.6 11 7 NA 6.9 12 16 256 9.7 13 11 290 9.2 14 14 274 10.9 15 18 65 13.2 16 14 334 11.5 17 34 307 12.0 18 6 78 18.4 19 30 322 11.5 20 11 44 9.7 21 1 8 9.7 22 11 320 16.6 23 4 25 9.7 24 32 92 12.0 25 NA 66 16.6 26 NA 266 14.9 27 NA NA 8.0 28 23 13 12.0 29 45 252 14.9 30 115 223 5.7 31 37 279 7.4 32 NA 286 8.6 33 NA 287 9.7 34 NA 242 16.1 35 NA 186 9.2 36 NA 220 8.6 37 NA 264 14.3 38 29 127 9.7 39 NA 273 6.9 40 71 291 13.8 41 39 323 11.5 42 NA 259 10.9 43 NA 250 9.2 44 23 148 8.0 45 NA 332 13.8 46 NA 322 11.5 47 21 191 14.9 48 37 284 20.7 49 20 37 9.2 50 12 120 11.5 51 13 137 10.3 52 NA 150 6.3 53 NA 59 1.7 54 NA 91 4.6 55 NA 250 6.3 56 NA 135 8.0 57 NA 127 8.0 58 NA 47 10.3 59 NA 98 11.5 60 NA 31 14.9 61 NA 138 8.0 62 135 269 4.1 63 49 248 9.2 64 32 236 9.2 65 NA 101 10.9 66 64 175 4.6 67 40 314 10.9 68 77 276 5.1 69 97 267 6.3 70 97 272 5.7 71 85 175 7.4 72 NA 139 8.6 73 10 264 14.3 74 27 175 14.9 75 NA 291 14.9 76 7 48 14.3 77 48 260 6.9 78 35 274 10.3 79 61 285 6.3 80 79 187 5.1 81 63 220 11.5 82 16 7 6.9 83 NA 258 9.7 84 NA 295 11.5 85 80 294 8.6 86 108 223 8.0 87 20 81 8.6 88 52 82 12.0 89 82 213 7.4 90 50 275 7.4 91 64 253 7.4 92 59 254 9.2 93 39 83 6.9 94 9 24 13.8 95 16 77 7.4 96 78 NA 6.9 97 35 NA 7.4 98 66 NA 4.6 99 122 255 4.0 100 89 229 10.3 101 110 207 8.0 102 NA 222 8.6 103 NA 137 11.5 104 44 192 11.5 105 28 273 11.5 106 65 157 9.7 107 NA 64 11.5 108 22 71 10.3 109 59 51 6.3 110 23 115 7.4 111 31 244 10.9 112 44 190 10.3 113 21 259 15.5 114 9 36 14.3 115 NA 255 12.6 116 45 212 9.7 117 168 238 3.4 118 73 215 8.0 119 NA 153 5.7 120 76 203 9.7 121 118 225 2.3 122 84 237 6.3 123 85 188 6.3 124 96 167 6.9 125 78 197 5.1 126 73 183 2.8 127 91 189 4.6 128 47 95 7.4 129 32 92 15.5 130 20 252 10.9 131 23 220 10.3 132 21 230 10.9 133 24 259 9.7 134 44 236 14.9 135 21 259 15.5 136 28 238 6.3 137 9 24 10.9 138 13 112 11.5 139 46 237 6.9 140 18 224 13.8 141 13 27 10.3 142 24 238 10.3 143 16 201 8.0 144 13 238 12.6 145 23 14 9.2 146 36 139 10.3 147 7 49 10.3 148 14 20 16.6 149 30 193 6.9 150 NA 145 13.2 151 14 191 14.3 152 18 131 8.0 153 20 223 11.5 |
【练习3】PPT-05第37页
案例1:求给定向量中的偶数个数
|--------------------------------------------------------------------------------------------------------------|
| Num <- function(x){ k=0 stopifnot(is.numeric(x)) for(i in x){ if(i %% 2==0){ k=k+1 } } return(k) } Num(1:9) |
data:image/s3,"s3://crabby-images/94bb2/94bb25a45e2bc497056b0bef7a80e80f3e5caa03" alt=""
【练习4】PPT-05第39页
案例1:自编函数计算标准差代码
|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| BZC <- function(x){ stopifnot(is.numeric(x)) stopifnot(length(x)>1) avg<-mean(x) sum=0 for(i in x){ sum=sum+(i-avg)**2 } return(sqrt(sum/length(x))) } BZC(c(1:5)) |
data:image/s3,"s3://crabby-images/01c2f/01c2f7c0bce2edc2376d9f3b6ea82cc763fdb3c0" alt=""
函数设计流程:
data:image/s3,"s3://crabby-images/81c78/81c78218475643e0ba77c28cdd7e63a3efc18ebf" alt=""
四:实验知识点总结
0:PPT-05上第29页中**【na.rm=FALSE】的作用是------保留数据集中的缺失值,并尝试计算包含这些缺失值的统计量。**当na.rm=TRUE时,函数会在计算统计量之前从数据集中删除所有的缺失值。
1:在变量的重命名中,rename函数可修改数据库和列表(不改变原数据集中的变量名),不能修改矩阵;names函数可修改数据库和列表(改变原数据集中的变量名),不能修改矩阵;colnames函数和rownames函数可修改矩阵和数据库的行名和列名。
2:数据排序函数的区别。
data:image/s3,"s3://crabby-images/3b130/3b13029324974f9fe9895632a0913d6625b8f9cf" alt=""
3:随机抽样函数的区别。
data:image/s3,"s3://crabby-images/ced0b/ced0bbd4c12b988077c70adf98693e29be058f87" alt=""
4:常用的数学函数。
data:image/s3,"s3://crabby-images/54743/5474363231eac3e93c900fde7fd508f84e2ecf6d" alt=""
5:常用的统计函数。
data:image/s3,"s3://crabby-images/a9f54/a9f54e16f88ac4183a241d00751f90a4b060b32e" alt=""
6:常用的高级数学函数。
data:image/s3,"s3://crabby-images/cb7a5/cb7a5f65bec309f29be63b18071a24f34ac46515" alt=""
7:apply家族的使用方法。
data:image/s3,"s3://crabby-images/7e668/7e668f3be02972593c05da753336820cb493392b" alt=""
8:函数体的组成部分。
data:image/s3,"s3://crabby-images/10c5c/10c5c0679f9e0e1f94abd33ef0819d2795176ab1" alt=""
五:遇到的问题和解决方法
**问题&解决1:**reshape2和reshape不是一个依赖包,只载入reshape包而不载入reshape2包,无法调用reshape2中的封装功能。因此,需要看清楚封装功能所来源的依赖包是哪一个,再进行调用。
**问题&解决2:**如果当前使用的镜像源太卡,会导致install.packages()失败。此时需要使用【chooseCRANmirror()】,并选择新的镜像源,然后再进行依赖包的安装和载入。或者通过例如【options(repos = c(CRAN = "https://cloud.r-project.org/"))】的方式更换。
**问题&解决3:**PPT中的思考题通过以下网页进行学习。