Programming in R – Zhuo Yao, Ph.D.

Programming in R

Contents

Introduction

General Overview

One of the main attractions of using the R (http://cran.at.r-project.org) environment is the ease with which users can write their own programs and custom functions. The R programming syntax is extremely easy to learn, even for users with no previous programming experience. Once the basic R programming control structures are understood, users can use the R language as a powerful environment to perform complex custom analyses of almost any type of data.

Format of this Manual

In this manual all commands are given in code boxes, where the R code is printed in black, the comment text in blue and the output generated by R in green. All comments/explanations start with the standard comment sign ‘#’ to prevent them from being interpreted by R as commands. This way the content in the code boxes can be pasted with their comment text into the R console to evaluate their utility. Occasionally, several commands are printed on one line and separated by a semicolon ‘;’. Commands starting with a ‘$’ sign need to be executed from a Unix or Linux shell. Windows users can simply ignore them.

R Basics

The R & BioConductor manual provides a general introduction to the usage of the R environment and its basic command syntax.

Code Editors for R

Several excellent code editors are available that provide functionalities like R syntax highlighting, auto code indenting and utilities to send code/functions to the R console.

Basic code editors provided by Rguis
RStudio: GUI-based IDE for R
Vim-R-Tmux: R working environment based on vim and tmux
Emacs (ESS add-on package)
gedit and Rgedit
RKWard
Eclipse
Tinn-R
Notepad++ (NppToR)

Programming in R using Vim or Emacs Programming in R using RStudio

Integrating R with Vim and Tmux

Users interested in integrating R with vim and tmux may want to consult the Vim-R-Tmux configuration page.

Finding Help

Reference list on R programming (selection)

R Programming for Bioinformatics, by Robert Gentleman
S Programming, by W. N. Venables and B. D. Ripley
Programming with Data, by John M. Chambers
R Help & R Coding Conventions, Henrik Bengtsson, Lund University
Programming in R (Vincent Zoonekynd)
Peter’s R Programming Pages, University of Warwick
Rtips, Paul Johnsson, University of Kansas
R for Programmers, Norm Matloff, UC Davis
High-Performance R, Dirk Eddelbuettel tutorial presented at useR-2008
C/C++ level programming for R, Gopi Goswami

Control Structures

Conditional Executions

Comparison Operators

equal: ==
not equal: !=
greater/less than: > <
greater/less than or equal: >= <=

Logical Operators

and: &
or: |
not: !

If Statements

If statements operate on length-one logical vectors.

Syntax

if(cond1=true) { cmd1 } else { cmd2 }

Example

if(1==0) {

print(1)

} else {

print(2)

}

[1] 2

Introduction

R Basics

Code Editors for R

Integrating R with Vim and Tmux

Finding Help

Control Structures

Conditional Executions

Comparison Operators

Logical Operators

If Statements

Ifelse Statements

Loops

For Loop

While Loop

Apply Loop Family

For Two-Dimensional Data Sets: apply

For Ragged Arrays: tapply

For Vectors and Lists: lapply and sapply

Other Loops

Improving Speed Performance of Loops

Functions

Useful Utilities

Debugging Utilities

Regular Expressions

Interpreting Character String as Expression

Time, Date and Sleep

Calling External Software with System Command

Miscellaneous Utilities

Running R Programs

Object-Oriented Programming (OOP)

Define S4 Classes

Assign Generics and Methods

Building R Packages

Reproducible Research by Integrating R with Latex

R Programming Exercises

Exercise Slides

Sample Scripts

Batch Operations on Many Files

Large-scale Array Analysis

Graphical Procedures: Feature Map Example

Sequence Analysis Utilities

Pattern Matching and Positional Parsing of Sequences

Identify Over-Represented Strings in Sequence Sets

Translate DNA into Protein

Subsetting of Structure Definition Files (SDF)

Managing Latex BibTeX Databases

Loan Payments and Amortization Tables

Course Assignment: GC Content, Reverse & Complement