Beginning Tutorials

Section Chairs:

Gerri Furlow

North Carolina State University

Raleigh, NC

Ron Fehd

Centers for Disease Control and Prevention (CDC)

Atlanta, GA

 

 

   
   

SAS/ASSIST Software for Beginners

   

Arturo Barrios and Sally Muller, University of North Carolina, Chapel Hill, NC

   

Four years ago we began teaching a two-hour introduction to SAS/ASSIST. Most of the hundreds of students have been SAS novices with limited computing knowledge of any sort. SAS/ASSIST has always provided a simple and non-threatening, yet very capable, tool for using SAS and learning the language.

   

Arturo Barrios is a Computer Consultant for the Customer Services Division of Academic Technology and Networks (ATN) at the University of North Carolina. Arturo began using SAS as a graduate student in the Dept. of City and Regional Planning, in 1991. He progressed to supporting SAS as a Computer Consultant and teaching SAS Short Courses, including SAS/ASSIST, SAS/CONNECT, and soon "Putting a SAS Application on the Web".

Sally Muller is the Manager of the Research Support Group in the Customer Services Division of Academic Technology and Networks (ATN) at the University of North Carolina. Sally began using SAS software as a programmer in 1978 and became a site representative in 1987. Since then she believes she has answered over 10 million of SAS user questions and taught several thousand introductory SAS classes, but admits that these numbers may be **slightly** inflated. Sally found that in teaching these students, who were often completely new to the computer, that SAS/ASSIST could provide a simple, nonthreatening, and yet very capable, tool for using the SAS System. A desire to share this revelation with other teachers of novice SAS users, was the inspiration of this paper.

 

 

   

Developing an Executive Information System with SAS/EIS Software

   

Matthew Becker, STATPROBE, Inc., Ann Arbor, MI

   

SAS has provided users with a new, intuitive, and powerful tool for developing information systems. SAS/EIS software provides users with a vast set of objects to aid in the development of custom information applications. In this tutorial, a simple bank account application will be developed. The following tasks will be stepped through:

  1. Details for a bank data set
  2. Creating the data set structure in SAS
  3. Registering of the data set in the application's metabase
  4. Building the application

  • Data entry system
  • Expanding report
  • Hierarchical organization chart
  • Multi-dimensional report
  • Graphical menu builder

The tutorial is geared towards users with little or no SAS/EIS experience.

   

Matt Becker received his B.S. (Magna Cum Laude) from Western Michigan University in Computer Science - Theory and Analysis. His professional career began as an Applications Programmer employed at STATPROBE, Inc. in 1990. He currently works as the Manager of Research Information Systems at STATPROBE, Inc., where he has been programming in SAS for 7 years. He uses SAS version 6.12 running on a Pentium PC with the Windows 95 operating system. He routinely works with base SAS and SAS/AF. His interests include object-oriented reporting systems and SAS/AF constructed user interfaces.

   

Introduction to SAS Arrays

   

Ken Currier, Norfolk Southern Corporation

   

Take another intriguing and sometimes mystifying tour of Humongus, Inc. and learn of the different ways that SAS arrays can assist you in your day-to-day programming requirements. This paper will present various programming situations in which SAS arrays can be used to streamline and automate work processes.

   

Ken Currier, Norfolk Southern Corporation, has over 19 years of experience in systems audit, accounting, costing, and tax areas. He is a SAS user of over 12 years. He has used SAS to design and develop numerous data extraction, analytical, bridging and streamlining systems for various corporate legacy systems as well as the GEAC Host Systems general ledger, project tracking, and fixed asset systems.

   

SQL: A Sequel to the DATA Step

   

Heidi Markovitz, Simply Systems, Miami, FL

   

This tutorial introduces the SAS implementation of the SQL language. It explains the basic structures and functions used in PROC SQL. Attendees will be shown how to use SQL to extract data from SAS data sets, report on that data, and build new tables and views from extracted data. For listeners in a data mode, plenty of attention will be given the Select statement and the various forms of Union (or Join). Examples will compare the SQL way of data processing with other SAS techniques, including the DATA step and PROC SORT. Use of the SQL window may also be demonstrated (given time).

   

Heidi Markovitz operates Simply Systems, a consulting practice in Miami, Florida. She specializes in custom development of data entry and display systems, using SAS AF/Frame, FSP and other modules. Heidi is a graduate of the Wharton School and has been SASsy for 12 years. She is a SAS Quality Partner.

 

 

   

Phonetic Matching of Character Data

   

S. David Riba, JADE Tech, Inc., Clearwater, FL

   

There are many occasions when character values must be compared on non-precise, or "fuzzy", criteria. For example, how do you find all the values in your data for a city such as Raleigh, NC. when the values could be spelled (or mis-spelled) as:

 

Raleigh, Raliegh, Ralehg, or even Roleg

 

Traditionally, SAS programming looks at the TEXT of the character values and attempts to define all of the possible text variations. There is another way. Most people, when they do not know how to spell something, will phoneticize it and spell it based on how it sounds. You can compare text the same way, based on how the text value SOUNDS -- phonetically. There are several tools in Base SAS to compare character data based on phonetic values. This paper will review these SAS tools, and present several techniques for comparing and summarizing data based on phonetic, instead of text, values. These techniques can be useful for all levels of SAS programming expertise.

   

S. David Riba is CEO of JADE Tech, Inc., a SAS Institute Quality Partner who specializes entirely in applications development, consulting, and training in the SAS System. He is the founder and President of the Florida Gulf Coast SAS Users Group. He chartered and served as Co-Chair of the first SouthEast SAS Users Group conference, SESUG '93, and serves as President of the Executive Council of the SouthEast SAS Users Group. His first SUGI was in 1983, and he has been actively involved in both SUGI and the Regional SAS Users Group community since then. He has presented papers and assisted in various capacities at SUGI, SESUG, NESUG, MWSUG, SCSUG, and PharmaSUG. Dave is an unrepentant SAS bigot. His major areas of interest are efficient programming techniques and applications development using the SAS System, particularly using Screen Control Language with SAS/AF and FRAME technology.

   

Statistical Presentations using the Power of PROC TABULATE

   

C. Olivia Rud, Advanta Corp.

   

PROC TABULATE is a versatile tool which can combine the functionality of PROC MEANS and PROC FREQ. Additional power exists in its ability to perform analysis on grouped data. This paper details the steps for constructing a table of marketing statistics using a variety of features. Missing values and formats are used to control the table calculations. Labeling and selected options provide finishing touches for controlling the table appearance.

   

C. Olivia Rud is Manager of Pricing Optimization at Advanta Corporation. She has a Masters in Decision Science with an emphasis in Statistics from Arizona State Universty. She has been a SAS user for nine years..

   

Learning About Your Data: Tips and Techniques for Looking at Large Files

   

Sandra Schlotzhauer, Schlotzhauer Consulting, and Bob Anschuetz, Quintiles, Inc., Research Triangle Park, NC

   

As a SAS consultant, you often receive unfamiliar data. A first step in analyzing or reporting on the data is to familiarize yourself with the data. For small files, or only a few files, simply printing the data may suffice. As the number of files and size of each file increases, this becomes increasingly less effective. We cover several techniques to investigate large files including: checking the structure of the data sets, finding duplicate variables, determining if the data values are reasonable, finding holes (unexpectedly large numbers or missing or identical values), using formats to handle long character values, using indicator variables instead of numeric variables, and more. We also cover general strategies, such as documenting the data sets as received, documenting changes, using subsets of the data for testing, and keeping the original data sets intact. Many of these tools are useful even for cases where you cannot change the original data, since the results give you a better understanding of the data. Examples are primarily from the pharmaceutical industry, but the techniques can be useful for other types of data as well.

   

Sandra Schlotzhauer provides training and consulting services with specialization in the pharmaceutical industry. Training focuses on JMP, SAS/INSIGHT, SAS/QC, and SAS/PH-Clinical. She started Schlotzhauer Consulting after nine years at SAS Institute; prior to that she worked as a consulting statistician in industry. Among other projects, she is currently updating the "SAS System for Elementary Statistical Analysis" with Dr. Ray Littell. Sandra has been a SAS user since 1979.

Bob Anschuetz is a Senior Statistical Programmer Analyst at Quintiles, Inc. With ten years of SAS experience and a background in Statistics, his current areas of concentration include: Integrated Database Design, NDAs, and Electronic Submissions. Prior to joining Quintiles, Bob worked at several major pharmaceutical companies in the North East United States.

   

Some Efficiency Tips for Beginning SAS Programmers

   

Debbie Skinner, John Alden Life Insurance Co., Sacramento, CA

   

There are a number of simple efficiency techniques that should be kept in mind whenever you are constructing a SAS program. These basic practices will almost always result in programs that run faster and use less computer resources. Beginning programmers are often relieved to just get programs to execute successfully, without being mindful of speed or resources. But in many organizations, computer resources are a scarce commodity. As a result, inefficient code is - at the very least - frowned upon, if not directly penalized in the form of unnecessarily high chargebacks. This paper presents a few of the simple efficiency techniques that all programmers, especially beginners, should consider when constructing SAS programs.

   

Debbie Skinner has been using SAS for data analysis for 13 years. She has worked for John Alden Life Insurance Company since 1983 and is currently the Director of Utilization Analysis in the Medical Management department. Debbie is active in the SouthEast SAS Users Group, having co-chaired the 1994 SESUG Conference and serving on the SESUG Executive Council.

   

Introduction to Data Warehousing: From Source Data to the Warehouse

   

Chris Toppe, AT&T Solutions, Rockville, MD

   

A data warehouse can be defined as a stable and predictable environment which enables decision making. But just exactly does that mean? What really is a data warehouse? And how is one created? This talk covers the basics of data warehousing and covers the major issues that must be addressed in creating one. This is a talk for novices in this rapidly growing area, designed to give an appreciation of the scope of a warehouse project.

   

Chris Toppe, Ph.D., is a manager with AT&T Solutions, the management consulting and systems integration unit of AT&T. He is a frequent speaker on data warehousing issues, with a particular focus on how users interact with the system. He has been a SAS user for 20 years and has served as section chair at SUGI and SESUG, and has spoken at most of the regional user groups. He is past president of DCSUG.

   

The Right Approach to Learning PROC TABULATE

   

Bob Virgile, Robert Virgile Associates, Inc., Woburn, MA

   

PROC TABULATE lets you create tables with statistics in the cells (instead of frequency counts), and with combinations of variables in each dimension of the table. On the other hand, very few procedures require their own 300-page manual. This tutorial presents a way to learn the procedure quickly, a method of getting started without having to study every available statement and option.

   

Bob Virgile is an independent SAS consultant. He has written numerous papers, books, and problem solving contests. He also develops and teaches an exceptional series of SAS classes

   

Exploring Data with Base SAS Software

   

Tom Winn, Texas State Comptroller's Office, Austin, TX

   

Statistical methods are tools which are used to summarize and analyze data. Exploratory data analysis is the application of graphical and statistical techniques to discover the structure of data. The goal of exploratory data analysis is to characterize the data and to reveal fundamental relationships among them. It is quick, dynamic, and highly interactive. Furthermore, exploratory data analysis is not just for use by professional statisticians -- the methods also are used by scientists, engineers, and many other types of researchers. This paper explains how to produce and to interpret scatter plots, histograms, stem-and-leaf plots, box-and-whisker plots, and various descriptive statistics, using Base SAS software. This paper should be accessible to people having experience with basic DATA and procedure programming, but no prior knowledge of statistics is required.

   

The author, Thomas J. Winn, Jr. is a mathematician and economist with the Texas State Comptroller's Office, in Austin, Texas. His specialties include mathematical modeling, econometrics, and tax policy research. Tom has used the SAS System for building econometric models, forecasting applications, statistical analysis, control charts, and statistically-based scoring systems for selection purposes. Tom is a co-founder of the South-Central SAS Users' Group, and he has been a SAS user for more than fourteen years.