Installation of R packages¶

In [1]:

#install.packages("ISwR")

Package loading¶

In [2]:

library(ISwR)

Variable definition and assignment¶

In [3]:

weight <- 60
height = 1.75
subject <- "A"
healthy <- TRUE

Variable evaluation¶

In [4]:

weight

60

Functions for type checking¶

In [5]:

is.numeric(weight) # variable 
is.double(weight)
is.integer(weight)
is.character(subject)

TRUE

FALSE

TRUE

Functions for variable conversion¶

In [6]:

weight <- as.integer(weight)
is.integer(weight)

TRUE

Computing the body mass index (BMI) from the weight and height¶

In [7]:

#Body mass index (BMI)
bmi <- weight/height^2 
bmi

19.5918367346939

Functions for string manipulation¶

In [8]:

message <- sprintf("%.1f", bmi)
print(message)

[1] "19.6"

Vector definition¶

In [9]:

weight <- c(60, 72, 57, 90, 95, 72) 
height <- c(1.75, 1.80, 1.65, 1.90, 1.74, 1.91)
subject <- c("A", "B", "C", "D", "E", "F")

Vector evaluation¶

In [10]:

weight
height
subject

60
72
57
90
95
72

1.75
1.8
1.65
1.9
1.74
1.91

'A'
'B'
'C'
'D'
'E'
'F'

Creating a vector with a particular size¶

In [11]:

vec <- rep(0, 10)
vec

0
0
0
0
0
0
0
0
0
0

Vector length¶

In [12]:

length(weight)

6

Vector indexes: from one to the length of the vector¶

In [13]:

weight[1]
weight[length(weight)]

60

72

Iteration: for loop¶

from one to the length of weight

In [14]:

bmi <- 0
for (i in 1:length(weight)) {
  bmi[i] <- weight[i]/height[i]^2
}

evaluation of the bmi vector

In [15]:

bmi

19.5918367346939
22.2222222222222
20.9366391184573
24.9307479224377
31.3779891663364
19.7363010882377

Iteration: while loop¶

run while i is below or equal to the length of weight

In [16]:

bmi <- 0
i <- 1
while (i <= length(weight)) {
  bmi[i] <- weight[i]/height[i]^2
  i <- i + 1
}

In [17]:

bmi

19.5918367346939
22.2222222222222
20.9366391184573
24.9307479224377
31.3779891663364
19.7363010882377

Remove a variable¶

In [18]:

rm(bmi)
exists("bmi")

FALSE

Right way of manipulating vectors: assigning at once¶

In [19]:

bmi <- weight/height^2 
bmi

19.5918367346939
22.2222222222222
20.9366391184573
24.9307479224377
31.3779891663364
19.7363010882377

Creating a function¶

name <- function(parameters) { body }

In [20]:

compute_bmi <- function(weight, height) {
  bmi <- weight/height^2 
  return(bmi)
}

Using a function with scalars¶

In [21]:

bmi <- compute_bmi(60, 1.75)
bmi

19.5918367346939

Using the same function with vectors¶

In [22]:

bmi <- compute_bmi(weight, height)
bmi

19.5918367346939
22.2222222222222
20.9366391184573
24.9307479224377
31.3779891663364
19.7363010882377

Example of a function to compute the average¶

(iterating in all elements of the vector)

In [23]:

average <- function(vec) {
    s <- 0
    n <- length(vec)
    for (x in vec) {
      s <- s + x  
    }
    return(s/n)
}

invoking the function

In [24]:

avg_bmi <- average(bmi)
avg_bmi

23.1326227087309

Example of a function to compute the average¶

(manipulating vectors at once)

In [25]:

average <- function(vec) {
    s <- sum(vec)
    n <- length(vec)
    return(s/n)
}

invoking the function

In [26]:

avg_bmi <- average(bmi)
avg_bmi

23.1326227087309

Average function using mean function¶

Major statistical functions are available in R

In [27]:

average <- function(vec) {
    return(mean(vec))
}

invoking the function

In [28]:

avg_bmi <- average(bmi)
avg_bmi

23.1326227087309

Working with vectors with NA¶

Operations with NA lead to NA.

In [29]:

x <- c(10, NA, 13)
y <- average(x)
y

<NA>

addressing NA with na.rm=TRUE¶

In [30]:

average <- function(vec) {
    return(mean(vec, na.rm=TRUE))
}

In [31]:

x <- c(10, NA, 13)
y <- average(x)
y

11.5

Plotting graphics¶

scatter plots

In [32]:

plot(height, weight)

Most functions contain many default parameters¶

In [33]:

plot(height, weight, pch=2)

Default function arguments can be shown with args¶

In [34]:

args(plot.default)

function (x, y = NULL, type = "p", xlim = NULL, ylim = NULL, 
    log = "", main = NULL, sub = NULL, xlab = NULL, ylab = NULL, 
    ann = par("ann"), axes = TRUE, frame.plot = axes, panel.first = NULL, 
    panel.last = NULL, asp = NA, xgap.axis = NA, ygap.axis = NA, 
    ...) 
NULL

All functions in R that belongs to packages have help with examples¶

In [35]:

?base::plot

Canvas for plotting is still active until a new plot¶

In [36]:

plot(height, weight)
hh = c(1.65, 1.70, 1.75, 1.80, 1.85, 1.90)
lines(hh, 22.5 * hh^2)

Factors¶

Factors are used to handle categorical data.

In [37]:

pain <- c(0,3,2,2,1)
fpain <- factor(pain,levels=0:3, ordered=TRUE)
fpain

0
3
2
2
1

Levels:

'0'
'1'
'2'
'3'

Levels provide correspondence between numerical values and categorical labels¶

In [38]:

levels(fpain) <- c("none","mild","medium","severe")
fpain

none
severe
medium
medium
mild

Levels:

'none'
'mild'
'medium'
'severe'

Convert height to factor¶

Levels: small, medium, high

coding setting element by element¶

In [39]:

lev <- rep("", length(height))

for (i in 1:length(height)) {
  if (height[i] < 1.7)
    lev[i] <- "short"
  else if (height[i] < 1.9)
    lev[i] <- "medium"
  else 
    lev[i] <- "tall"
}
lev <- as.factor(lev)
lev

medium
medium
short
tall
medium
tall

Levels:

'medium'
'short'
'tall'

coding setting the vector at once¶

It uses the cut function.

In [40]:

lev <- cut(height, breaks=c(0, 1.7, 1.9, .Machine$double.xmax), ordered=TRUE)
lev
levels(lev) <- c("short", "medium", "tall")
lev

(1.7,1.9]
(1.7,1.9]
(0,1.7]
(1.7,1.9]
(1.7,1.9]
(1.9,1.8e+308]

Levels:

'(0,1.7]'
'(1.7,1.9]'
'(1.9,1.8e+308]'

medium
medium
short
medium
medium
tall

Levels:

'short'
'medium'
'tall'

Matrix¶

Matrices can be filled from vectors or data frames.

In [41]:

x <- 1:9
x

1
2
3
4
5
6
7
8
9

Converting a vector to matrix¶

In [42]:

dim(x) <- c(3,3)
x

A matrix: 3 × 3 of type int
1	4	7
2	5	8
3	6	9

accessing elements from a matrix¶

In [43]:

for (i in 1:nrow(x)) 
    for (j in 1:ncol(x))
        print(x[i,j])

[1] 1
[1] 4
[1] 7
[1] 2
[1] 5
[1] 8
[1] 3
[1] 6
[1] 9

Iterating and assigning values to each element¶

In [44]:

y <- x
for (i in 1:nrow(y)) 
    for (j in 1:ncol(y))
        y[i,j] <- 3 * y[i, j]
    
y

A matrix: 3 × 3 of type dbl
3	12	21
6	15	24
9	18	27

Assigning the values of a matrix at once¶

In [45]:

y <- 3*x
y

A matrix: 3 × 3 of type dbl
3	12	21
6	15	24
9	18	27

Converting a vector to a matrix by row¶

In [46]:

x <- matrix(1:9,nrow=3,byrow=T)
x

A matrix: 3 × 3 of type int
1	2	3
4	5	6
7	8	9

transposing a matrix¶

In [47]:

x <- t(x)
x

A matrix: 3 × 3 of type int
1	4	7
2	5	8
3	6	9

computing the determinant of a matrix¶

In [48]:

det(x)

0

Lists¶

Lists are used to work with "objects"

In [49]:

a <- c(5260,5470,5640,6180,6390,6515,6805,7515,7515,8230,8770)
b <- c(3910,4220,3885,5160,5645,4680,5265,5975,6790,6900,7335)

mybag <- list(a, b, 0, "a")
mybag

1. 5260
2. 5470
3. 5640
4. 6180
5. 6390
6. 6515
7. 6805
8. 7515
9. 7515
10. 8230
11. 8770
1. 3910
2. 4220
3. 3885
4. 5160
5. 5645
6. 4680
7. 5265
8. 5975
9. 6790
10. 6900
11. 7335
0
'a'

adding an element into a list

In [50]:

n <- length(mybag)
mybag[[n+1]] <- "b"
mybag

1. 5260
2. 5470
3. 5640
4. 6180
5. 6390
6. 6515
7. 6805
8. 7515
9. 7515
10. 8230
11. 8770
1. 3910
2. 4220
3. 3885
4. 5160
5. 5645
6. 4680
7. 5265
8. 5975
9. 6790
10. 6900
11. 7335
0
'a'
'b'

List slicing¶

In [51]:

slice <- mybag[1]
slice
is.list(slice)

1. 5260
2. 5470
3. 5640
4. 6180
5. 6390
6. 6515
7. 6805
8. 7515
9. 7515
10. 8230
11. 8770

TRUE

Slicing is also a list¶

In [52]:

slice <- mybag[c(1,3)]
slice
is.list(slice)

1. 5260
2. 5470
3. 5640
4. 6180
5. 6390
6. 6515
7. 6805
8. 7515
9. 7515
10. 8230
11. 8770
0

TRUE

A list is also a vector¶

In [53]:

#list is also a vector
is.vector(slice)

TRUE

Member reference¶

It accesses the element

In [54]:

h <- mybag[[1]]
h

5260
5470
5640
6180
6390
6515
6805
7515
7515
8230
8770

An element can be evaluated. In this case, it is a vector.

In [55]:

is.vector(h)
is.list(h)

TRUE

FALSE

Naming variables¶

They are properties on the list

In [56]:

mybag <- list(x=a, y=b, const=0, lit="a")
mybag

$x

5260
5470
5640
6180
6390
6515
6805
7515
7515
8230
8770

$y

3910
4220
3885
5160
5645
4680
5265
5975
6790
6900
7335

$const

0

$lit

'a'

Adding, accessing, and removing elements¶

In [57]:

mybag$c <- mybag$x - mybag$y
mybag$const <- NULL
mybag$lit <- NULL
mybag

$x

5260
5470
5640
6180
6390
6515
6805
7515
7515
8230
8770

$y

3910
4220
3885
5160
5645
4680
5265
5975
6790
6900
7335

$c

1350
1250
1755
1020
745
1835
1540
1540
725
1330
1435

Data frames¶

Data frames (tables) provide support for structured data.

In [58]:

d <- data.frame(A=a, B=b)
head(d)

A data.frame: 6 × 2
	A	B
	<dbl>	<dbl>
1	5260	3910
2	5470	4220
3	5640	3885
4	6180	5160
5	6390	5645
6	6515	4680

Adding a column in the data frame¶

In [59]:

d$c <- d$A + d$B
head(d)

A data.frame: 6 × 3
	A	B	c
	<dbl>	<dbl>	<dbl>
1	5260	3910	9170
2	5470	4220	9690
3	5640	3885	9525
4	6180	5160	11340
5	6390	5645	12035
6	6515	4680	11195

In [60]:

d$A <- NULL
head(d)

A data.frame: 6 × 2
	B	c
	<dbl>	<dbl>
1	3910	9170
2	4220	9690
3	3885	9525
4	5160	11340
5	5645	12035
6	4680	11195

Reading csv file¶

There are many functions for reading CSV, Excel, and RData formats.

In [61]:

wine = read.table(
    "http://archive.ics.uci.edu/ml/machine-learning-databases/wine/wine.data", 
                  header = TRUE, sep = ",")
  colnames(wine) <- c('Type', 'Alcohol', 'Malic', 'Ash', 
                      'Alcalinity', 'Magnesium', 'Phenols', 
                      'Flavanoids', 'Nonflavanoids',
                      'Proanthocyanins', 'Color', 'Hue', 
                      'Dilution', 'Proline')
head(wine)

A data.frame: 6 × 14
	Type	Alcohol	Malic	Ash	Alcalinity	Magnesium	Phenols	Flavanoids	Nonflavanoids	Proanthocyanins	Color	Hue	Dilution	Proline
	<int>	<dbl>	<dbl>	<dbl>	<dbl>	<int>	<dbl>	<dbl>	<dbl>	<dbl>	<dbl>	<dbl>	<dbl>	<int>
1	1	13.20	1.78	2.14	11.2	100	2.65	2.76	0.26	1.28	4.38	1.05	3.40	1050
2	1	13.16	2.36	2.67	18.6	101	2.80	3.24	0.30	2.81	5.68	1.03	3.17	1185
3	1	14.37	1.95	2.50	16.8	113	3.85	3.49	0.24	2.18	7.80	0.86	3.45	1480
4	1	13.24	2.59	2.87	21.0	118	2.80	2.69	0.39	1.82	4.32	1.04	2.93	735
5	1	14.20	1.76	2.45	15.2	112	3.27	3.39	0.34	1.97	6.75	1.05	2.85	1450
6	1	14.39	1.87	2.45	14.6	96	2.50	2.52	0.30	1.98	5.25	1.02	3.58	1290

saving in binary format¶

In [62]:

save(wine, file="wine.RData")

removing data frame from memory¶

In [63]:

rm(wine)

load binary format¶

In [64]:

load("wine.RData")
head(wine, 3)

A data.frame: 3 × 14
	Type	Alcohol	Malic	Ash	Alcalinity	Magnesium	Phenols	Flavanoids	Nonflavanoids	Proanthocyanins	Color	Hue	Dilution	Proline
	<int>	<dbl>	<dbl>	<dbl>	<dbl>	<int>	<dbl>	<dbl>	<dbl>	<dbl>	<dbl>	<dbl>	<dbl>	<int>
1	1	13.20	1.78	2.14	11.2	100	2.65	2.76	0.26	1.28	4.38	1.05	3.40	1050
2	1	13.16	2.36	2.67	18.6	101	2.80	3.24	0.30	2.81	5.68	1.03	3.17	1185
3	1	14.37	1.95	2.50	16.8	113	3.85	3.49	0.24	2.18	7.80	0.86	3.45	1480

exporting data.frame into csv file¶

In [65]:

write.table(wine, file="wine.csv", row.names=FALSE, quote = FALSE, sep = ",")

filtering vectors¶

In [66]:

a <- c(5260,5470,5640,6180,6390,6515,6805,7515,7515,8230,8770)
b <- c(3910,4220,3885,5160,5645,4680,5265,5975,6790,6900,7335)

# logical vector
bool <- (a > 7000)
bool

FALSE
FALSE
FALSE
FALSE
FALSE
FALSE
FALSE
TRUE
TRUE
TRUE
TRUE

In [67]:

# selecting elements from positions that are true
a[bool]

7515
7515
8230
8770

In [68]:

# filtering with logical expressions
b[a < 6000 | a > 7000]

3910
4220
3885
5975
6790
6900
7335

In [69]:

b[6000 <= a & a <= 7000]

5160
5645
4680
5265

filtering data frames¶

In [70]:

data <- data.frame(a=a, b=b)
data$c <- data$a - data$b
head(data, nrow(data))

A data.frame: 11 × 3
	a	b	c
	<dbl>	<dbl>	<dbl>
1	5260	3910	1350
2	5470	4220	1250
3	5640	3885	1755
4	6180	5160	1020
5	6390	5645	745
6	6515	4680	1835
7	6805	5265	1540
8	7515	5975	1540
9	7515	6790	725
10	8230	6900	1330
11	8770	7335	1435

In [71]:

head(data[data$a > 7000,])

A data.frame: 4 × 3
	a	b	c
	<dbl>	<dbl>	<dbl>
8	7515	5975	1540
9	7515	6790	725
10	8230	6900	1330
11	8770	7335	1435

In [72]:

head(data[data$a > 7000,c(1,2)])

A data.frame: 4 × 2
	a	b
	<dbl>	<dbl>
8	7515	5975
9	7515	6790
10	8230	6900
11	8770	7335

performance with matrix and data frames¶

In [73]:

rheight <- rnorm(100000, 1.8, sd=0.2)
rweight <- rnorm(100000, 72, sd=15)

computing a entire column at once¶

In [74]:

start_time <- Sys.time()
hw <- data.frame(height=rheight, weight=rweight)
hw$bmi <- hw$weight/hw$height^2
end_time <- Sys.time()
end_time - start_time
object.size(hw)

Time difference of 0.002118826 secs

2400984 bytes

processing cell by cell¶

In [75]:

start_time <- Sys.time()
hw <- data.frame(height=rheight, weight=rweight)
for (i in 1:nrow(hw)) {
  hw$bmi[i] <- hw$weight[i]/hw$height[i]^2
}
end_time <- Sys.time()
end_time - start_time

Time difference of 4.172815 secs

convert the entire column¶

In [76]:

start_time <- Sys.time()
hw <- data.frame(height=rheight, weight=rweight)
hw <- as.matrix(hw)
hw <- cbind(hw, 0)
for (i in 1:nrow(hw)) {
  hw[i,3] <- hw[i,2]/hw[i,1]^2
}
end_time <- Sys.time()
end_time - start_time

Time difference of 0.14888 secs

apply family¶

apply functions can be applied for all rows or columns.

The first character of the function name establishes the return type (s: simple, l: list).

In [77]:

library(ISwR)
data(thuesen)
head(thuesen)

A data.frame: 6 × 2
	blood.glucose	short.velocity
	<dbl>	<dbl>
1	15.3	1.76
2	10.8	1.34
3	8.1	1.27
4	19.5	1.47
5	7.2	1.27
6	5.3	1.49

In [78]:

#lapply returns a list
lapply(thuesen, mean, na.rm=T)

$blood.glucose: 10.3
$short.velocity: 1.32565217391304

In [79]:

#sapply returns a vector
sapply(thuesen, mean, na.rm=T)

blood.glucose: 10.3
short.velocity: 1.32565217391304

In [80]:

# apply - second parameter (1: by rows, 2: by columns)
m <- as.matrix(thuesen)
apply(m, 1, min, na.rm=TRUE)
apply(m, 2, min, na.rm=TRUE)

1.76
1.34
1.27
1.47
1.27
1.49
1.31
1.09
1.18
1.22
1.25
1.19
1.95
1.28
1.52
8.6
1.12
1.37
1.19
1.05
1.32
1.03
1.12
1.7

blood.glucose: 4.2
short.velocity: 1.03

sort and order¶

In [81]:

library(ISwR)
data(thuesen)
head(thuesen)

A data.frame: 6 × 2
	blood.glucose	short.velocity
	<dbl>	<dbl>
1	15.3	1.76
2	10.8	1.34
3	8.1	1.27
4	19.5	1.47
5	7.2	1.27
6	5.3	1.49

In [82]:

sort(thuesen$blood.glucose)

4.2
4.9
5.2
5.3
6.7
6.7
7.2
7.5
8.1
8.6
8.8
9.3
9.5
10.3
10.8
11.1
12.2
12.5
13.3
15.1
15.3
16.1
19
19.5

In [83]:

order(thuesen$blood.glucose)

17
22
12
6
11
15
5
9
3
16
23
7
24
18
2
8
10
19
21
14
1
20
13
4

In [84]:

o <- order(thuesen$blood.glucose)
sorted <- thuesen[o,]
head(sorted)

A data.frame: 6 × 2
	blood.glucose	short.velocity
	<dbl>	<dbl>
17	4.2	1.12
22	4.9	1.03
12	5.2	1.19
6	5.3	1.49
11	6.7	1.25
15	6.7	1.52

Pipelines¶

The operator $\%$>$\%$ creates a pipeline.

The first parameter of the next invoked function receives the data from the pipeline.

Library $dplyr$ contains a set of functions that support relational algebra operations.

In [85]:

flight_data <- read.table(text = "Year Quarter Flights Delays
                     2016 1 11 6
                     2016 2 12 5
                     2016 3 13 3
                     2016 4 12 5
                     2017 1 10 4
                     2017 2 9 3
                     2017 3 11 4
                     2017 4 25 15
                     2018 1 14 3
                     2018 2 12 5
                     2018 3 13 3
                     2018 4 15 4",
                     header = TRUE,sep = "")  
head(flight_data)

A data.frame: 6 × 4
	Year	Quarter	Flights	Delays
	<int>	<int>	<int>	<int>
1	2016	1	11	6
2	2016	2	12	5
3	2016	3	13	3
4	2016	4	12	5
5	2017	1	10	4
6	2017	2	9	3

In [86]:

#install.packages("dplyr")

In [87]:

library(dplyr)
result <- flight_data %>% 
   filter(Delays > 5) %>% 
   select(Year, Quarter, Flights)
head(result)

Attaching package: ‘dplyr’


The following objects are masked from ‘package:stats’:

    filter, lag


The following objects are masked from ‘package:base’:

    intersect, setdiff, setequal, union

A data.frame: 2 × 3
	Year	Quarter	Flights
	<int>	<int>	<int>
1	2016	1	11
2	2017	4	25

In [88]:

library(dplyr)
result <- flight_data %>% 
   group_by(Year) %>% 
   summarize(mean = mean(Flights), sd = sd(Flights))
head(result)

A tibble: 3 × 3
Year	mean	sd
<int>	<dbl>	<dbl>
2016	12.00	0.8164966
2017	13.75	7.5443135
2018	13.50	1.2909944

In [89]:

nrow(flight_data)
head(flight_data)

12

A data.frame: 6 × 4
	Year	Quarter	Flights	Delays
	<int>	<int>	<int>	<int>
1	2016	1	11	6
2	2016	2	12	5
3	2016	3	13	3
4	2016	4	12	5
5	2017	1	10	4
6	2017	2	9	3

In [90]:

#install.packages(reshape)
library(reshape)
result <- melt(flight_data[,c('Year', 'Quarter', 'Flights', 'Delays')], 
             id.vars = c(1,2))
nrow(result)
head(result[c(1:3,17:19), ])

Attaching package: ‘reshape’


The following object is masked from ‘package:dplyr’:

    rename

24

A data.frame: 6 × 4
	Year	Quarter	variable	value
	<int>	<int>	<fct>	<int>
1	2016	1	Flights	11
2	2016	2	Flights	12
3	2016	3	Flights	13
17	2017	1	Delays	4
18	2017	2	Delays	3
19	2017	3	Delays	4

merge¶

The function $merge$ can be used to join data frames. It can be used to produce inner, left, right, and outer joins.

In [91]:

stores <- data.frame(
  city = c("Rio de Janeiro", "Sao Paulo", "Paris", "New York", "Tokyo"),
  value = c(10, 12, 20, 25, 18))
head(stores)

A data.frame: 5 × 2
	city	value
	<chr>	<dbl>
1	Rio de Janeiro	10
2	Sao Paulo	12
3	Paris	20
4	New York	25
5	Tokyo	18

In [92]:

divisions <- data.frame(
  city = c("Rio de Janeiro", "Sao Paulo", "Paris", "New York", "Tokyo"),
  country = c("Brazil", "Brazil", "France", "US", "Japan"))
head(divisions)

A data.frame: 5 × 2
	city	country
	<chr>	<chr>
1	Rio de Janeiro	Brazil
2	Sao Paulo	Brazil
3	Paris	France
4	New York	US
5	Tokyo	Japan

In [93]:

stdiv <- merge(stores, divisions, by.x="city", by.y="city")
head(stdiv)

A data.frame: 5 × 3
	city	value	country
	<chr>	<dbl>	<chr>
1	New York	25	US
2	Paris	20	France
3	Rio de Janeiro	10	Brazil
4	Sao Paulo	12	Brazil
5	Tokyo	18	Japan

In [94]:

result <- stdiv %>% group_by(country) %>% 
   summarize(count = n(), amount = sum(value))
head(result)

A tibble: 4 × 3
country	count	amount
<chr>	<int>	<dbl>
Brazil	2	22
France	1	20
Japan	1	18
US	1	25

statistical tests: t-test¶

There are many statistical tests in R. One of the most used is the t-test. It checks if the mean of observations is not different from a theoretical value.

In [95]:

weight <- c(60, 72, 57, 90, 95, 72) 
height <- c(1.75, 1.80, 1.65, 1.90, 1.74, 1.91)
bmi <- weight/height^2

In [96]:

t.test(bmi, mu=22.5)

	One Sample t-test

data:  bmi
t = 0.34488, df = 5, p-value = 0.7442
alternative hypothesis: true mean is not equal to 22.5
95 percent confidence interval:
 18.41734 27.84791
sample estimates:
mean of x 
 23.13262

In [ ]: