[From the U.S. Government Printing Office, www.gpo.gov]
ro @), 10 INVESTIGATION INTO THE FEASIBILITY OF ESTABLISHING A STATEWIDE GEOGRAPHIC INFORMATION SYSTEM November 1979 @-j MERS M 2- BB :"o M 1/6\ F 0 5 p MEBE M2F oT MZFL M2 RS G FO V p COASTAL ZONE INFORMATION CENTER. HD 211 M2 M14 1979 urces Program purce Planning Division --.2 Planning Office INVESTIGATION INTO THE FEASIBILITY-OF ESTABLISHING A STATEWIDE, NATURAL RESOURCES, GEOGRAPHIC INFORMATION SYSTEM FOR MAINE (FINAL REPORT) OCTOBER, 1979 Maine State Planning Office and Ell, Maine Land and Water Resources Council Acknowledgements: The principal author was CHARLES YUILL., a Massachusetts Audubon Society environmental intern working with the State Planning Office. MAL CAREY of the Social Sciences Research Institute, University of Maine, prepared Section V; JOSEPH CHAISSON of the State Planning Office directed this study, prepared portions of the report, and provided final editing. Many thanks are given as well to other state and regional agency staff members who patiently assisted this project through a series of interviews and by providing review comments. Financial assistance for the preparation of this report was provided by the Department of Housing and Urban Development under their Comprehensive Planning Assistance (Section 701) Program; the U. S. Water Resources Council, and the Massachusetts Audubon Society Environmental Intern Program. TABLE OF CONTENTS I. RECOMMENDATIONS AND EXECUTIVE SUMMARY ... ........... 3 11. INTRODUCTION ................. ......................................... 8 Ill. BASICS OF GEOGRAPHIC INFORMATION SYSTEMS .... o...... o........... oo .... 11 IV. ELEMENTS OF SPATIAL DATA SYSTEM DESIGN .................... o....... o- 76 V. COMPUTER HARDWARE AND SOFTWARE AVAILABILITY.. ........ o ....... oo ... o... 13E3 V1. DETERMINING SPECIFICATIONS FOR A GEOGRAPHIC INFORMATION SYSTEM . ................. o.... o................ o .................... o .... o. APPENDICES: A. Glossaryof Technical Terms Bo A Review and Evaluation of Selected Current and Developing State Geographic Information Systems C. A Proposed Natural Resources Information Planning System for Maine D. References 2 SECTION I RECOMMENDATIONS AND EXECUTIVE SUMMARY A RECOMMENDATIONS: 1. Because of the current level of costs associated with establishing a statewide GIS, the anticipated steady decline in computer hardware costs, and the federal goverment commitment to establish a number of GIS's with instate applications, MAINE SHOULD NOT PROCEED, AT THE-PRESENT TIME TO ESTABLISH A STATEWIDE GIS, HOWEVER, DEVELOPMENT AND EVENTUAL ESTABLISHMENT OF AN EFFECTIVE ... STATEWIDE IS,THAT WOULD INCLUDE CAPABILITES ADEQUATE TO MEET REGIONAL PLANNING AND RESOURCE MANAGEMENT NEEDS AND THAT WOULD BE COMPATIBLE WITH EMERGING CADASTRAL LAND RECORD SYSTEMS SHOULD BE CONSIDERED A LONG TERM GOAL OF THE STATE, 2. A DATA COORDINATOR SHOULD BE DESIGNATED WITHIN THE STATE PLANNING OFFICE INCLUDED WITHIN THIS COOR0DNATORS RESPONSIBILITIES WOULD BE: A. PROVISION OF STAFF SUPPORT AS NECESSARY T0 THE DATA MANAGEMENT SUBCOMMITTEE OF THE LAND AND WATER RESOURCES COUNCIL; B. PURSUING IMPLEMENTATION., EITHER DIRECTLY OR AS A PRO- JECT SUPERVISOR OF THE GIS RECOMMENDATIONS THAT FOLLOW, 3. USER NEEDS ASSESSMENTS FOR DATA STORAGE, RETR1EVAL,AND ANALYSIS SHOULD BE UNDERTAKEN ON A REGULAR (BI-ANNUAL) BASIS.THESE ASSESSMENTS SHOULD INCLUDE USERS AT BOTH STATE AND REGIONAL LEVELS AND SHOULD EMPHASIZE DATA NEEDED TO ADDRESS PRIORITY RESOURCE MANAGEMENT PROBLEMS, 4. THE STATE NATURAL RESOURCE DATA COLLECTION PLAN SHOULD BE COMPLETED AND USED AS THE BASIS FOR AN ACTION PLAN TO PROVIDE THE COMPLETE DATA BASE NECESSARY FOR AN EFFECTIVE STATEWIDE GIS. 5. A PROCESS SHOULD BE INITIATED TO CONTINUE THE DESIGN OF A G1S SPECIFICALLY INCLUDING THE DEVELOPMENT.OF METHODOLOGIES FOR THE ANALYSIS OF MAINE NATURAL RESOURCES DATA.THAT WOULD BE CONTAINED WITHIN A STATEWIDE. GEOGRAPHIC SYSTEM. 6. ADVANCES IN SOFTWARE FOR GIS APPLICATIONS AND THE EVOLUTION OF CAPABILITIES AND COSTS OF COMPUTER HARDWARE SUITABLE FOR GIS APPLICATIONS SHOULD BE REGULARLY MONITORED. 7. PROGRESS MADE BY FEDERAL AGENCIES TOWARDS DEVELOPMENT OF AN AUTOMATED DATA BASE FOR MAINE AND INFORMATION SYSTEMS THAT USE SUCH DATA SHOULDBE REVIEWED ON AN ANNUAL BASIS, 8. THAT ALL CONTINUATION OF WORK TOWARDS THE DEVELOPMENT AND EVENTUAL ESTABLISHMENT OF A STATEWIDE GIS BE CONDUCTED UNDER POLICY GUIDANCE OF THE LAND AND WATER RESOURCES COUNCIL. 9. If A STATE OR REGIONAL PLANNING AGENCY. INTENDS TO ESTABLISH. AN. AUTOMATED,GIS FOR EITHER GENERAL USE OR FOR SPECIFIC PROJECT WORK THE AGENCY SHOULD SUBMIT A DETAILS) PROJECT DESCRIPTION TO THE DATA MANAGEMENT SUBCOMITEE OF THE LAND AND WATER RESOURCES COUNCIL FOR REVIEW THE SUBCOMITTEE SHALL REVIEW SUCH PLANS AND MAKE COMMENT 3 ON THE RELATIONSHIP-OF THE SPECIF-IC PROPOSAL TO ONGOING PLANS FOR ESTABLISHING-A STATEWIDE GIS.. THE OBJECTIVE OF THIS REVIEW PROCESS IS TO ACHIEVE THE MAXIMUM CONSISTENCY WITH AN EVENTUAL STATE SYSTEM THAT IS POSSIBLE WITHIN SPECIFIC STATE OR REGIONAL AGENCY PROGRAM REQUIREMENTS AND TO AVOID UNNECESSARY DUPLICATION OF EFFORT. SUMMARY The specific purpose of this study has been to investigate the feasibility of establishing a statewide geographic information system for the storage, retrieval, and analysis of spatially distributed natural resources data. The work outlined in this report is a preliminary investigation of Geographic Information Systems,(GIS) and their relationship to land planning and management in Maine; it contains a series of findings and recommendations adopted by the L&WRC concerning state GIS development. As envisioned in this report the goal of a "Maine Natural Resource Information System" is to provide the best possible vehicle for agencies to fulfill their statutory requirements and satisfy needs in terms of resource planning, development, management, and the conservation of Maine's natural resources. Natural resource related information is collected by various state agencies in accord with their separate legis- lative mandates. However, agency specialization alone does not identify completely the type of information that is being accumulated or what is available to other agencies. At the present time specialized data systems are functioning at the regional and state level in Maine, but the mechanism does not exist to link, in common network, the sum of sources and users of natural resource data/information. It is important that the data/information needs of the agencies involved be met in the most cost-effective andefficient way possible. A geographic information system can accomplish this.task by functioning as an integrating force, not by interfering with primary data gathering activities, but by properly tapping existing and potential sources, services, funding and systems in a way that will effectively unite the user with needed data. In Maine the value of such a system by necessity would have to be judged in terms of addressing: Consistent format, quality, standards andscales of data Improved indexing and storage of existing information Improved coordination of ongoing planning programs Less duplication in data storage Elimination of conflicting data Eventual hardware and software compatibility Such a system need not be totally a system in the strict engineering sense. It should merely be a service mechanism for 1) assembling data in both machine readible and non-machine formats; 2) processing raw data into meaningful information; 3) adjusting and organizing data into formats and forms suitable for modern storage, retrieval, and manipulation procedures; 4) storage of data in a systematic information base; and 5) displaying this data in graphics, models, maps, studies, and simulations appropriate to specific audiences and problems. Basic to the implementation of 4 successful system are three critical elements: 1) knowledgeable users of the system; 2) the requisite data base; and 3) operational system,capabilities determined by the needs of the users. These elements are the focus of this study- 4 STUDY FINDINGS 1. The establishment of a statewide GIS would benefit Maine by ..a) Improving the availability of information to support state resource management and decision making activities; b) Reducing the costs of analysis and application of natural resource information compared with current, decentralized manual anaiysis techniques; and c) Providing.information useful in regional (substate) planning and resource management activities. 2. Natural resource information is currently collected and used by many state agencies in carrying out legislative mandates. A wide range of "data systems" exist within state agencies that use natural resources data to support resource planning, management, and decision making activities. These "systems" range from manual use of secondary data, to age'ncies that rely upon in-house, primary data collection with associated automated data analysis. 3. Important data series that would be used by a statewide GIS (soils, surficial geology, land cover, etc.), are in many cases incomplete, Data that exists is scattered through 'inaccurate, or out of date. files,libraries, and information systems offederal, state, and private organizations. Therefore, the full potential of a statewide GIS would not be achieved until a complete, accurate, and timely natural resource data base exists. 4. The federal government plans to establish an extensive series of digital format,,natural resource data files for Maine, and also has developed or is developing a number of GIS's for the storage, manipu- lation, and retrieval of such data. Many of these systems could be adapted for use at,the state level. It is likely that a general federal GIS for natural resources data will evolve, combining major federally established natural resource digital data bases that would be available for use by the state. 5. The Department of Conservation, Bureau,of.Forestry, is in the pro- cess of establishing in-state use of GIS's operated by the U. S. Forest Service, U. S. Department of Agriculture., This involvement is funded under the Renewable Resources Planning Act (RPA), and has includ'ed hiring a full-time professional staff person to undertake the necessary programming to support-such involvement. 6. Computer hardware Suitable for GIS applications is rapidly evolving. The initial and operating costs of such hardware are steadily decreasing at. the present time. The technology of performing input and output of geographic data is changing very rapidly with higher quality choices and possibly lower costs expected.. A proliferation of excellent compute software applicable to geographic information systems currently exists in stages of application, design, and development. Available software usable for establishing a statewide GIS ranges from that available For free within the public sector to expensive private sector products. Soft- ware for GIS applications is evolving at a very rapid rate. 7. A satewide GIS could be established at the present time in the following ways: a) In-house system using existing Central Computer Services hardware; b) In-house system using a new minicomputer that would be "designed" for and dedicated to GIS use. c) Contract with University of Maine d) Private contract. 8. Staffing costs associated With any of the four alternatives would be approxi- mately a, minimum of $60,000/year, 9. The optimum approach to establishing a state system at the present time would be through an in-house, dedicated minicomputer system. The hardware can- figuration of such a system would consist of: a) Dedicated Central Processing Unit (CPU). A stand-alone mini or micro computer system-with large address space and substantial computational capability will be required to meet the demands created by . A GIS development and production activities stand-alone system will offer many advantages over shared use of a larger computer facility: radically cheaper cycles, better thruput, control of hours of operation and operating priorities, control of tape and disk mounting and printout dispatching, and a much simpler systems environment. 6 b) Tape Subsystem, An industry-standard 1600 6p. tape subsystem will be required for disk back-up and for large-file communication and archiving. c) Disk Subsystem, A disk system (Possibly using fixed, Winchester drives) is required to provide on-line and working storage of data and programs, d) Line Printer. Amedium-speed line printer is required to provide reasonab I e printing turn-around. It is strongly suggested that the unit also have dot matrix plotting capability for displaying pixel-oriented data. e) Digitizer. A large-bed (42" x 60") digitizer is recommended to allow con- venient digitizing of even, large format maps without requiring segmentation, Interface to the system should be seriial for simpli- city and flexibility. f) Graphic Display Tube. A high-resolution graphic terminal (either storage tube or refresh) is required for quick and inexpensive inspection of geographical data. The interface should be serial for most choices of CPU and graphic terminal to allow future system reconfiguration and en- hancement with minimal hassle. g) Plotter; A 30-36 inch drum plotter will offer the best performance/price ratio in the early years of a state GIS activity. 10. Data processing hardware that would be used to establish the recommended state system configuration is improving so rapidly that cost esti,mates are difficult to prepare and would have little meaning. Annual hardware and software costs would probably be less than the minimum annual staff costs. 7 SECTION 11 INTRODUCTION Interagency cooperation in natural resource planning and management in Maine is encouraged through the efforts of the Maine Land and Water Resources Council (L&WRC). The Council consists of representatives of state agencies with natural resource management and planning-related responsibilities, the University of Maine, Regional Planning Commissions, and the Legislature. In June of 1976, William Adams, then Commissioner of the Department of Environmental Protection and Council Chairman, requested that a study be done for the Council to determine what Maine's approach should be towards establishing a statewide, automated, geographic information system for natural resources data. Commissioner Adams made this request, in part, because of concerns about the rather large investments some Regional Planning Commissions were making in such systems as part of their 208 programs. Responsibility for advising the State Planning Office in conducting the study was assigned to the to the Council Data Management Subcommittee. Funding and staff resources to conduct the study were not obtained unti I June, 1977. A draft of the study report was widely circulated for review during the spring and summer of 1978. Based upon review comments received, revisions were made to the study report and a set of findings and recommenda- tions for state action were prepared for the Data Management Subcommittee to review, The Subcommittee met in January 1979'to discuss the draft findings and recommendations. Numerous changes to 4e recommendations were proposed, agreed upon, and forwarded to the Council for adoption. The Council met on October 25, 1979 and voted to adopt recommendations concerning establishing a statewide GIS as presented in this report. The purposes of this study are to: I . Determine to what extent a need exists for a state-level geographic information system (GIS) for natural resources data; 2. Inventory and describe federal activities directly related to the establishment of a state level GIS; 3. Review alternative methods of establishing a state-level GIS; and 4. To recommend what specific actions should be taken by the Land and Water Resources Council concerning the establishment of a state- level GIS. To accomplish these purposes, the study examines.the history, structure, design, and current utility of GIS's and also describes a framework for examining information o requirements for -land and water resources planning and management in Maine. The examination of information requirements addresses operations associated with data collection, handling, storage, retrieval, analysis, and display. This study does not recommend either the selection of a specific GIS or the setting of specifications for a specific GIS. Rather, a process is proposed that would work systematically over fime to establish an appropriate, well thought out state level GIS. it is clearly recognized that numerous existing needs of resource planners and managers are currently being met with available statistically based data management 8 srstems and various nonautomated spatial data management systems; and we expect t is situation to continue. Consequently, it is recognized that, as they are developed, geographic information systems that contain such spatial natural resource data at land cover types or surficia I geologic materials, statistical systems containing summaries of such data types as deer ki H, and systems concerned with land parcel ownership and boundaries must act together with flexible boundaries established between them. The study is organized into six sections and a number of appendices. Section I is an executive summary of the entire study, including its findings and recommendations. Section 11 is this introduction, Section 111, "Basics of Geographic Information Systems" provides a brief review of the compon ents of spatial data handling systems,, the frameworks within which they are expected to operate, typical products of such systems, and a detailed discussion of the considerations that are important in designing spatial data handling systems. Section IV, "Elements of Spatial Data 5ystem Design, focuses upon laying the ground- work for a continuing long-term data and system availability monitoring program. It is divided into two subsections. The First, "Determination of System Specifications" presents a framework for a long-term work effort whose objective would be to assess the information needs, data handling requirements, and data manipulation capabilities of potential system users, This subsection also contains a preliminary survey of the system needs of various state resource planning and management agencies. The second sub- se ction, "Data Availability, ".reviews the current and potential availability of system data from federal sources, Section V, "Hardware and Software Avai labi Ii ty, " revi ews the current avai labi Ii ty of software and computer hardware for establishing a state level GIS; and it also pro- poses an ongoing process for monitoring future developments in these areas. 9 Section VI, Determining Specifications for a Geographic Information System, presents a hierarchy of potential system specifications based upon the information concerning data needs, data handling, data availability, and the state of the cut in hardware and software development that was presented in earlier sections. Appendix A is a glossary of technical terms used in this report. Appendix B presents a review and evaluation of current and developing GISs in other states. Appendix C presents a proposed overview description of an integrated natural resources information planning program for Maine; and Appendix D is a list of references cited in the report. 0 SECTION III BASICS OF GEOGRAPHIC INFORMATION SYSTEMS Introduction Definitions Some Functions of Geographic Information Systems in Planning Examples of Geographic Information System Outputs Spatial Information System Options The Structure of Geographic Information Systems Data Aquisition. Data Characteristics Data Framework Data Evaluation Data Most Often Included in Geographic Information Systems Software and Hardware Software (System Procedures) Hardware Technical Operations of the System Introduction Data Input Functions Data Manipulation Procedures Data Analysis and Retrieval Data Output and Display Technical Procedures Summary Users of the System Management of the System Procurement of Hardware and Software Summary of Chapter I Basics of Geographic Information Systems 3 .1 Introduction 3 .1.1 Definitions: A geographic information system is a class of information systems which are defined by specifying the characteristics whi ch qualify information as being geographic. (Tomlinson, 1972). An information system is an ordered combination of data bases, resources (staff, equipment, etc.) and procedures designed to produce information useful to a decision making process. An information system therefor, is user demand driven, the user being defined as anyone responsible for responding to problems faced during the management of a resource. The relationship among the decision making proc ess, systems users and various elements of the information system are outlined in Figure I . 1 . The activities central to any information system can be described as being one of four categories oi- subsystems: 1) management of data base; 2) data processing;.3) data analysis and; 4) use of information (Fig. 1.2). *Figure 1.2 Basic Information Systems Functions System Management Data Data Informatio Processing Analysi s Use *Source: Tomlinson, 1972 12 EXTERNAL ACTIVITY SENSING DECISION MU!NG PROCESS INTERNAL DECISION QUESTION USER ANSWER NO DATA AVAILABLE USER NEEDSJ USER C ARGES ? YES ACCESS INFORMATION SYSTEM DESIGN ACCESS INTERFACE INTERFACE SYSTEM SUPPORT MANAGEMENT MATERIAL F FORMAL STAFF EQUIPMENT FORMAL REQUEST BUILD114G OPERATIONS DISPLAY MAINTENANCE _T INFORMATION DATA DATA RETRIEVAL a AOUISITION PROCE SSING SYSTEM MODELING INPUT DATA DATA BASE I INTERFACE STORAGE I BASE PERFORM N E ACQUISITION a EMENT SPECIFIC ICNS TESTING L DATA PROCESSING TECHNOLOGY BOUNDRY OF HARDWARE - SOFTWARE INFORMATION SYSTEM MANUAL- AUTOMATED Fi g. 1. Basi c Elements of a Geographic Information System DATA A C MANAG AT R N NG R CIE P 0 SS USER CHARGES@ YES INFOR N YRAL @A AG 0 M EQUE ST @DATA PS MARC MA N source:Tschanz and Kennedy, 1975. 13 Implicit in Figure 1.2 is the notion that data becomes information only after it has been retrieved and processed for particular purposes. The boundary condition which separates geographic information systems from other.types is the requirement that the data be referenced in a manner which will allow retrieval, analysis, and display based on spatial criteria (i.e., map display of soils information). Data description of various natural resource and socio-cultural aspects of the landscape generally have a describable spatial component. That is, data describing objects, entities or conditions incorporate identifiers that prescribe where they are. These data types can be directly related to places. They are linked to the places by locational identifiers such as coordinates of latitude and longitude, arbitrary grid cells, administrative areas, or merely by the position of one factor relative to all its adjacencies. When a set of techniques for handling data includes procedures for handling locational identifiers in concert with descriptive information, then such techniques are referred to as "spatial data handling systems. " The term "spatial data" is commonly, used as a synonym for "geographical data" and such techniques are referred to as "geographic data handling techniques." As previously reviewed, the concept of geographic or spatial systems is much broader than the concept of geographic data handling. The spatial information system must consist of the functions which include the observation and collection,of data through data analysis to theur use in some decision making process. The range of spatia I information system functions are shown below. Figure 1.3 Spatial Information Function Boundaries observations medSurerqent - desqripti explan- cision 4n fire -Fdec ation -J DATA DATA GATH ERI L G H A m r) i i Ni DATA ANALYSIS DECISI N 14 To.be of any utility, a geographic information system must be part of an institution, Its function is to gather data and process them for some use of the institution. Da*ta handling is a limited, though essential, part of the process. If the data being handled by the system are spatial data as defined before, and the techniques being utilized are spatial data handling fe chniques, then the system is referred to as a "geographic information system. 3.1.2. Some functions of geographic information systems in planning Generally, natural resource planning responds to both demand projections and the relative abundance or scarcity of natural resources. This enables an assessment of the relationship between demand satisfaction and actual resource utilization. This definition assumes that all physical characteristics associated with a given landscape (land, water, and plant and animal ecosystems) are potentially within the domain of natural resource planning. New concepts aimed at defining and evaluating landscape characteristics and land use activities more rigorously than in the past are now required in resource planning. Historically, land classification systems h,3ve combined concepts of land use activity, intensity, and several other factors including zoning, ownership and natural features. In the future, it appears that land use and landscape unit classification schemes define d specifically by activity, intensity and any number of hydrologic and ecologic parameters related to descriptions of impacts and not only appearance will be required. It will be critical to refine classification schemes to depict factors clearly and unambiguously as the focus of planning shifts from growth to a focus which better balances growth and environmental Conservation. Generally, natural resource planning is best designed for implementation at the large regional (i.e.., river basin), state or multi-state regional scale. Implicit in such natural resource planning are six basic activities: 1) inventory; 2) estimation of resource demand; 3) resource analysis; 4) plan 15 formulation; 5) plan implementation; and 6) plan monitoring. (Figure 1.4). Fig. 1.4 Resource Planning and Managrnent C R A L1 E S CK-M-- C"M DATA SOCIO-ECONOMI NATTLI ACQUISMON %i.,vrION, i= D:TF'C*,Il AVON ANALYSIS lr@rftv or DI:NL-N%lr) llnc.-Casinr! WEINIAND AND DLSF"Y AND HLSOURCZ r.ZSO;:!ZC=) ANALYSIS FORW%T, AIODEEIS :)-LS .-7 (PLAN 'U.7-ATION 11ENUAL MODEUS PLAN O"T"XI MONIT OR Source: U.S..G.S. 1975 1) Inventories cover the data required for estimating the demands and the data on natural resources which are identified to be of primary importance in the description of a particular region. 2) Inventoried demographics and economic conditions permits the assessment of internal and external demands forces which can be analyzed. This step also involves projecting various demographic and economic parameters for the given region. Alternatives range from no and slow growth to accellerated growth can be examined and detailed. j, T -e FO-t, '0 E IiF I L 16 3) Resource assessm ent is an estimation of capability, suitability, and carrying capacity of the landscape and its various resources. The carrying capacity of a region is reflective of the maxi-mum resource demand,that can be accomodated within acceptable limits. Such limits cannot be objectively defined but are derived from the goals and objectives of the population of the region. 4) Plan formulation involves balancing resource demand with resource availability within broad constraints set by the broad regional goals. The process of plan formulation is ideally designed to produce sets of alternatives or scenarios which are reviewed by appropriate decision makers,and are expressed as either physical plans or policy plans against which specific proposals are evaluated. 5) Evaluation of alternatives proceeds from alternative formulation and the selection of an alternative based on predetermined criteria is the require d output, 6) The final stage is plan monitoring. This process often involves new-data gathering, measurement and comparison of implementation achievements with previously stated goals and objectives A complete. plannilng process deals with the whole of'a region and total demands on resources. As well, specific functional planning is often required for dealing with issues suchas facilities siting or resources which are of unique signiflicance for the region. Such activities fall into two main categories: activities motivated by conservation or preservation Ci e. , critical areas); or ones motivated by the development or provision of special services (i.e., power plant siting). An additional data load results from these specific planning activities because of the necessity for greater -precision and resolution in data being.utilized to accurately depict selected geographic areas (Fig. 1.5). This general resource planning process is outlined only to serve as a reminder of the problems to -which a geographi'c information and data handling system will be expected to respond The amount of data implied by this process is s.a extensive that systematic techniques (manual and computerized) must be explored and implemented for data acquisition, storage, retrieval and utility. Generally, all alternatives to purely manual techniques involve the use of computers. 17 qw 'NO ow on) im, Va. lo Aw, 411111111111111 04, Mw OW 40) LOCAL LEVEL (Regulation Emphasis) information System Planning/Decis ion- Making Process Capabilities REGIONAL LEVEL (Area-Wide Planning Emphasis) Pla nning/Decis ion- Making Process Information System Capabilities STATE LEVEL (Policy Emphasis) Planning/Decis ion- Making Process Information System Capabilities Reappraisal Policy Formulation Inventory area measurement land indices Regulation Plan Development socio-economic - Overlay Locati / - - .. change detection Impact Assessment on/Allocation coincidence of physical features coincidence of physical and soclo-economic data - Models . descriptive . predictive Fig. 1.5 Hierarchy of Resource Planning Processes Relative to Information Systems source:nuerker 1975 As well, geographic information systems present powerful tools for dealing with issues associated with environmental impact assess ment. State wide systems have been used to generate data about fa cility sites, service areas and zones of impa ct. (See Table 1. 1) The use of an information system for impact analysis can help to determine the achievements of objectives of efficiency and equity for the assessment by providing easily accessible datato measure external effects, benefits and costs, both for general regions and for specific areas and population groups. The expected use and utility of data types determines how data must be collected and handled (Figure 1.6)., Formal data analysis models Ci.e., suitability for residential development mapping) are generally the required end use@of geographic resource information. However, even when formal models are the analysis method. the data sti I I must serve as imputs into what are called mental or informal models (the politically based decision making process). Data handling techniques must respond to the requirements of both formal models (accurate, reliabl e,-, timely, etc.) and informal' models (understandable, clear, concise data outputs). Figure 1.7 Information Use Decision Making (The Political Process) (informal Models) Natural Resource Geographic Information Planning (Formal System and Data Base Models) In this context, information systems must be multi-faceted, particularly if faced with responding to different appropriate levels of spatial and temporal scale. The information system must respond to the relationships between levels. of government, and the nature of functions such as area-wide planning, impact assessment and environmental regulations. 19 IMPACT TYPE LAND USE CHARACTERISTICS SOCIO-ECONOMIC DATA Spatial Incidence - Facility Site Amount of land taken, by type, for Number of persons, homes, facility businesses, etc., taken by facility - Service Area Amount of land, by type, in service Number of persons, households, area in service area - Impact Zone(s) Amount of land, by type, in zones Number of persons, households, impacted by facility in zones impacted by facility C) Achievement of Obiectives - Allocational Efficiency Measurement of externalities Aggregate benefits and costs - Distributional Equity Measurement of external effects Distribution of benefits and to specific properties costs to population groups Table 1.1 Use of Information System for Impact Assessment (Source:Duecker 1975) owl' on, low -owl lift), law Iwo, PLANNING/DECIS 10 N- MAKING PROCESS Information Location/ System Policy Plan Allocation Impact Elements Reappraisal Formulation Development (Site-Selection) Assessment Begulation_. Capabilities Inventory Change de- Indicators of Indicators of Population, Development Building tiaction achieving Implementa- public facil- applications permits objectives tioh ities, travel time - Overlay Change from Coincidence Coincidence landuse to of land fea- of land fea- I landuse tures tures - Models Land Indices Allocation Allocation, Location/al- Impact dif- efficiency, efficiency, location algo- fusion models, distribution distribution rithms, effi- e ff ic: ie ncy equity equity ciency, equity equity Spatial Scale Analysis Municipality State, re- State, re- Distance be- Service area, Ownership County, Re- gion, county, gion tween points Impact zone panel gion municipality of supply & demand Basic Unit Land Use Natural unit County Natural Eligible Natural unit Ownership (cell, poly- units sites panel gon) Soclo-Econ - rural Enumeration ED or County ED ED, MCD, ED District (ED) or,County - urban Census Block MCD ED or Tract Block, ED, ED Tract B lock Figure 1.6 Capability and Spatial Scale Requirements for Information System to Support a Statewide Planning/Decision Making Process (source.Duecker 1974) 3 .1.3. Examples of Geographic Information System Outputs Output products are usually defined as documents (either data listings or mapv) which can be used directly by the planning profession and/or the decision maker. This infers that no additional manipulation or analysis may be required before the information is utilized. For example, resource planning at the regional or state level is often concerned with the amount and distribution of vari-ous land uses. This type of information can be obtained from an information system in a variety of ways. A map is usually prepared from aerial photographs which outlines the boundaries of each use for which identification was required. The map contains both the boundaries of each use and a code which identifies the use. From mapped parcels, areas occupied before each use can be estimated. If more accurate information is required, areas can be measured by dot grid or planimeter, Such manual techniques often require several interations to ensure an acceptable level of accuracy. Another alternative is to code the mapped data for computer processing and direct the computer to m easure each area (see fi gures 1 .8. 1.9, and I . 10). It is also often desireable to note changes in land use over time. The some basic techniques are available to note changes in land use as to record present land use. One can obtain the photo- graphs, hand draw and then measure changes or such changes can be measured with the computer. Often the most useful products from the latter are tables showing changes which have occurred and specific maps depicting various kinds of land use change. Maps such as agricultural productivity and land surface cover, if overlaid, can produce comparisons of immediate interest to resource planners. Questions concerning land conversion, amounts of remaining agricultural land, the relationship of productivity to actual use can be easily, answered with graphic and tabular analysis pr ocedures. As well, other different map types can be combined manually or by computer to derive composite map results. For example, maps of salt water and topography can be overlaid to produce a map of non-fresh water coastal wetlands. Maps of physical capability, topoclimate and scenic resources can be overlaid to delineate areas most suitable 212 V il- i:;V:= r. r V z::; ::V.=:VV.. u:I-=:--:V; 4;="V1i7Lj;7-::@@!@.:7,""` -Im 1, MY1111! .4- :@: , V V. I;.VVI.-.ll-:N-,.j;, ........... ...... ....- N it ............. MV -SIV @V:%=SSVVV;..:V;;;.;.. V= :VW.V: ... .. .... .. n.--V :.3m.Vi-iVa@-:@ n, -I it'll - p r=V= -.VVV3;I!. I . ........ .IV I....... ........ ------VV :V%VV"nn .............. ... ......... IRl. SiLE; NIS V p. . . . . ... .... . .. JVI- V ` .:- Hiii. .4jF.,VVtV..V- ::Wltil; iS .1 `7 11M j, ................ ........... ... .......... ........... ......... ... .......... .......................... @:ii ::=:;=.: . ................ nii, ........... .. . .......... I .. . ........ 3 2 . .......... --7.. _;;,, .. !F--- : t. - . - i: ............ .. ...... .VIE. - iEE:i .......... .... . ....... . .. ...... ........... . ............................... ...... ISIS ....................... .1, ..... ................. ...... .... .. VIt H2 :1 -3 mlitull I ll;-I r. Slit ... .... .. ........ .. ....... .. .......... . ..... nwll-l- I n! ..... .. ... ... :3 =V- iq @Vll LV -.VN IV: Mu ::@tl ..... . ...... `14i,; ,:::-= .. . .. ..... . ... ..... .. V- iufrik!F@ .. ... ... V: ===@ -- -- - -Z=s ....... . . ...... I ::.,. VV-V-- V;: ! - -V:;;;V===V. U! .. ............. ... . .......... :V.V %=::V- =V@III V:iV iVI77p ... ....... H, .............. . ...... INS Xv.. ,I .......... . ........ ..... W It EV-- IS Oil ::=L=V==:.:: Vz:%. I0"iV IN -@E ... ... . . IW, -=V. I Of PH !11 .. . .......... V . .. ... --- Ni "it 6111111tiMi i .-:;IWI iAiil .1 1:v7- 11 111111- @:: :; .:. -.;: i 51 It, N! . ..... i'lin: @ -- 2 @- :F7 --7 VHV-:@.I i: ff7 F's TsI V I SIPV =Sim VMS . ..... 6. -:56 Hi . -1 -.; :, I` I =Z3 al Vq it U IN,---.,=-. :.- V . 411P1 .la :SIZ IN R."111i ;; I :V=VVa .9 IVV.... ....... dTfV- S. :%=-1 rilK it H jVS11S:;:; V. I =:F=j qr-V;V-- aw11-4q IS: II p- I pip . L is, 9VL 1-51 iii.;;4 "Ittilil ..t; V, ISM ilIT $!:I, xiji@illllli viiii,.niiii !:-:i 1@ Ti. ;1:.:;; V; rp" ji ilgii;,i jl@ - ,. .. i T 1:, IT'S @RVV V;. TI: V==1VV- Mill.!v!::!i@,Ti 1...S;.; ... - IV; IV _10" :IV TPI: `VVVI z 11li 1,-SM. II.I: ..:- --V; V ::t ;;V! SIR ::[::Ai@ -% V:1 iV! .4- mu Vil N ;V=VVt3V -:02 . . ........ RM IS Sit. =-g 2 a. ........ ;:=@Miannflll --V. IS V. N VS (R-x-FM@F7 =F:,iii MIHIII, i V 5: -V V. V_ .5- I! --.l; Vg .... ........- rza li-... -- N! -.6 R. .is "ill'" :MN-4.-IML ti: IV. -H!T! @P. TO ;,V. :I. t-z IVI g 7 L F Vii, P - L iV - :S VAli, Ulp qj!,:@ IIII.- =t EV! =2iw nV: V111 Q . "-.=N:lI .... - - V. IO.Ni R: Z z Mfl -,RV:;::V:V IS .. ...... IS FEj ... . ...... SSSSS%:VViS=VrV ..... .... . -:,ZEE . .... IV! V- V 2 . ..... . ...... V=r .......... E. Eta :7 a=V.- I Zz 77! ITS :V=.---- -"IS It- VI%1111 . ...... . ...... VVVV==:V- SM V;: -H ,:!:: V. :- =VVV1 U. IV. IV-. ..... .. T V V U IS V" .--VVVVISVl--VVV:, V ...'N'.. -7: -. VU .,V**.... .-V:;:VVVV=:=rV=- V= IS -VInn . . ......... IS -:V . . . . . . . ... ..... :=i- - Vu .. . ...... ..... :VVV . ... . ..... 1:zz; ............. V:; V.V: Lt.- . ....... ............... nt! -R- ..... ...... .................. :VV= ....... . ........... ... . ......... VV=.r==l=nVV: ------ - --- It VVV:Si9V==V::Vi=SVV VVnVSV=V ... . . ............ ::=--VV---=VVVV_-R ... ........... . =.@-%:;;V ;VV=.=::=VV VV:.VS.1V:::.V=:VV: V; nV r V-Z V:. -VS% VU: NVV V :2 V .-..:V =V ;VnVgVV .... . ... .. . ..... .. VViiiMV==- ....... :V VVnV:;VVVv=tV::;:= UIZZO ..V-qVVVW U=iSV ;VVV::=i=lV=: . ......... ... .... . ........ . .. ....... .VV:V. -VV: liiiVi V;:.: ZZV V 23- :t i . .........V....... . ......... :V:Vr::VVVV:VVVU: -V V.- -V V @iiz V1 ggl- i@.*"@@@:--..,@@-i-F.@..,-.-:--@@i-..'fV=I -:F- F Eiiiiiii Sio: . ..... IS Il ri I V- i;lz i4i. =.:F=, fl, SIM @,--,VVVVV=V=;. V:VV:zV%t V-V LV:---VV:=NI.V1, ......... . . ..Vn::%V=V.VV:V . . . . ..... ......... :V: VVSVH.-V==-:V :1:: VV!V_V UVIVMV ... . ....... .... =3 VV-aV_ il it Mm ............ ..... --V: ............... :;V . ........... -,-SzViSVVV:tVIV:V;: a :VVV :;zV I V .:;u IT . ....... .. ..... ....... ... . ... . . . . ........ .. ............. . . 7 .............. ......... ... ... .. .. ..... ... . V, iV; Figure 1.8 Example Geographic Information System Mapping (soil suitabi .lity for agriculture in Iowa) 23 source:Duecker 1975 -I z 6 -M Liar --- Figure l.--Base Map Figure 2.-!-Magnified portion of Huron--Ri-ver Basin- CST MOSS DO36 SID030 )`Otx6w1EEL COFIFORnTION 6siLVDISTONE PLAYING CONPANY 1C TV OF TPSILIWI WASTE WE -MG/L CL , K. LU%KININ I E-LRICK 51 NSIL11#61 KfChfGM 00 YV31LMATI MJChJGAH 0197 YPSILANTI Hlbomw *MA'j I RT, I I SILSIMTOIE PLAI 114@-TPS [LAST I 11PSILAIIII I'M MRY t-(X1UINt%1 7;1ON'AMUR CO. CO OISCHAt'GE 10 SARI FARVOR 1144 L. HIVIKIT =111MION JN W. SIXEI MIC Q4316 SILkiRSIONE P0111% Cmemy L-,.. , I- 0197 ,,ILA RIELL Caw. 7 MMICK 51. AL.",cm, Or SIVINTI (CL"XRFLF#41CEI YPSILANTI hic"115j" 14-CFE WRIER VEATMENT PLANT WIFILL 1wow, IN 5ILVFRICINC ?a L" T"ILANIT 1,1VIIALL TIC O""ATION 20.0 VICI)OLING WAIVI TO IOXIN AT MIETKAFEU SNtE 0 ,NI VIMNII W.W.T.P. RLL "FA 00 14 *OPYALL IWON"", I"" gh)@Xlvo Ext CN-TFSILFINTI Tw. INO OAYR) 170 " I @VAIII:71ERS ENS TE'WXCF,F1-DW-"w.80O`-S M ORTR) PH '30LIOSM.OIL-CRX 10.0 0. Figure 3.-Typical Station Data WATER RESOURCE DATA Figure 4.--Water Quality Degradation Figure 1.9 Fxample of Geographic Information System Output (source I GU 1972) lip M, :60) an, PCT POP. UNDER DIST r in 12.5 25.0 37.5 50.0 62.5 75. LA MC3 z;* 3 n (D X 0, Ln 0 ITI -0 C,) Lf) 0 M o LA (Q (D Q r) 0 57 cl c 10 for new housing development. Essentially, the typical kinds of questions faced by resource and land planners are concerned with the distribution, quantity, quality, and change in the characteristics of natural resources. It is of geographically referenced natural resource information which deals best with understanding these resource concerns with the provi sion of tabular and mapped information required for professional planners and decision makers alike. 31.1.4. Spatial Information System Options An agency faced with the task of building a data and information base for planning has the problem of creating inventories which describe the state of all relevant data elements. There are three basic approaches-, (1 ) the required data elements can be identified and programs developed to collect these data in a set of inventories; ar (2) data collected from other sources, usually for speciFic other purposes, may be acquired and combined in various ways to be useful in resource planning. and (3) a combination of (I ) and (2). Each of these approaches have significant advantages and disadvantages. In the first approach, the design and process of data collection operations are time consuming and very costly. The majoi advantage is the ability to control the definition of relevant data elements, and the manner in which they are collected. In the second approach, the time and costs for data collection are not serious problems; however, the responsible party must design data handling procedures wh;ch can accomodate varying record formats, recording media, and data columes represented in the totality of the data identified as relevant. It is possible that the savings in cost at the acquisition stage may be offset by subsequent costs of data manipulation for varying formats and record volumes. It must be noted, unfortunately, that data quality is not at the center of most agencies concerns as they approach planning. Data seem to be assessed only on the basis of :How precisely does the data 26 item tell me what I need to know? Can the data be used directly or must it be manipul ated first? Tomlinson (1971 ) notes the significance of dcito development through a comparison of information system component development in terms of the current state of the art for each component (see Figure 1 11). Hardware is by far the most advanced, followed by system software. Data documentation and collection are the least advanced and most difficult part of any information system effort. There are four basic options available for performing the range of technical operations associated with geographic information systems: manual; computer assisted, interactive computer assisted; and automated. No known systems are currently completely automated, so it is recognized that complete automation is questionable as an alternative. Data handling systems must maintain a high degree of flexibility in order to allow easy adoptions to changes in preceived needs by system users. The methods used to maintain flexibility in the various technical operations are the cornerstone of information system development. Flexi- bility is best mainatined by planning for continued system openess (i.e. avoidance of needless automation). For example, if information system functions (i.e., storage of data) can be accomplished efficiently using only manual techniques then a manual system should evolve. In other situations, Manual data handling methods can be combined with computer oriented techniques relegating the computer only to those tasks best accomplished by the machine (i.e., repetitive statistical analysis or map overlaying). The technical options have implications for methods of information transfer, storage, retrieval, analysis and display. For manual systems, operations include interpreting photos, drawing maps and preparing output reports, anaiys is and graphic presentations. Computer associated procedures include encoding the data base into a computer readible form which often requires the addition of specific equipment such as digitizers and interactive editing devices and storage devices such as cards', tapes, or discs. 27 Figure 1.11 Information System Development @SUBSYSTEMS ELEMPITS LEVEL OF DEVELOPMENT Data Definition W+ Acquisition Collection k Hardware Data %L Inaut and System Software jr Storage User Software Information Analysis Use Use Relative order of maqnitude (expressed as proportion development) FIGURE 1 .11 is presented to illustrate the relative developmental stages of the important aspects of geographic information systems. 28 For manual. system options, data retrieval consists of extracting a required map, photo or statistical table from a larger data file based on some criteria for extracting the specific observations. With computer assisted options, retrieval operations are accomplished by software which are utilized for the selection of desired data from the files located in various computer storage devices. The interactive computer aided option includes built in feedback.mechan isms that allow retrieved items to be visual ly.displayed for examination. to, determine if they are to be included in, a special d ata file being created for the intended analysis. Manipulations are carried out by additional software. Outputs, such as maps and tabular listings can be also produced manually or with compiter tied peripheral equipment (line printer, plotter, television screen). Outputs from computer aided options sometimes require manual modifications for use by decision makers but should not require modification for use by professional planners. 3 .2 The Structure of Geographic Information Systems Geographic information systems are not only composed of data and data handling techniques, but are a composite of these and a number of other related primary components. The components recognized as being central to geographic information systems are: (1) data acquisition; (2) software and hardware; (3) the technical operating procedures of the system; (4) the system's relation to its potential users; (5) the management of the system; and (6) system component procurement requirements. 3.2.1. Data Acquisition The data acquisition component of geographic information systems must deal with all the system's data, their characteristics and the institutions and systems which are vital in data supply or in assisting in their acquisition. The data acquisition component of geographic information systems must address the issues of: 29 A. Data Characteristics: data fo rmatting, record formats,, and data volumes location and ownership of data documentation for data classification of data data collection plan B. Data Framework Parameters procedures for acquisition continuity in data flow informal data provision agreements data supplier - data user program coordinator changes as required in data collection C. Evaluation of Data applicibility to the problems of users assessments of accuracy and reliability 3 .2.1.1. Data Characteristics Within systems a number of data characteristics affect their subsequent use and ease with which they are processed. Within the framework of an information system, some important characteristics are: accuracy,. precision, reliability; coverage, scale, relevance; formatf volume, density. A. Accuracy can be assessed by determining the uniformity of data over areas, by measuring 30, uni formi ty of recording medium (maps); and by estimating spatial precision and timeliness. B. Precision of spatial data is affected by the choice of medium and data format, by the geographic referencing system, by the nature of assigned descriptors. Every change in format or scale introduces possible errors and generally a loss of information. Also, precision may be reduced if information is interpolated from data obtained from scattered points., such as with ambient water quality data. C . Reliability is a measure of the quality of data as it relates to a problem and to the assumptions and requirements of specific data analyses. The ultimate test for reliability is simply how the data set corresponds to reality both spatially and descriptively. D. Coverage relates to the geographical - spatial extent and completeness of the data and the content of the coverage (does it have the desired classification scheme). E. Scale refers to the.size of a recording medium for spatial data in relation to real world dimensions. The importance of scale is recognized as significant when data A may be recorded at one scale and archived or analyzed at another, often resulting in the loss or distortion of information. F. Relevance refers to the appropriateness of data to performing a task or answering a question. To be relevant, data must be able to meet specific analytical requirements and meet standards of accuracy, scale, and timeliness. Data characteristics (A) through (F) are characteristics which affect data use. Do they tell the user what he wants -to know, make inferences heneeds to make to perform es@ sential manipulations and analyses ? The following data characteristics are those which affect data handling. That is, these are characteristics with implications for data processing (manual or automated) and they relate to the formats, volumes, and density of data. These characteristics have significant implications 31 on information system design and must be recognized in the initial stages of system design. G. Format refers to both the format of data records and individual data elements. Record format relates to the arrangement of data for capture and storage (maps, cards, magnetic tape, tabular, etc.), and graphic symbols and other delineations used (i.e., legends for maps). Written records, graphic records and machine readable records are the three general types. All three are generally a part of information systems and are useful for various specific data types. Record formats are the observable results of a data collection effort. They define input for geographic inf ormation systems and are therefor a prime concern in all such systems. Data format refers to the mode in which real world objects are represented as data. The format is both the representation of a real world element and a locational identifier marking its position, area, and extent. Data format is important for software development as it affects both data manipulation and program structure. In terms of location data may be point, line or area data. Standard maps, such as US GS quads, often contain a11 three data types, but information systems generally require separate inputting of lines, areas, and point information. Where both locational and descriptive data are to be utilized, the two are separated for computer inputting and then merged internally via special software. H. Volume again refers to both records and individual data elements. Record volume is the number of separate physical records on which data is recorded, such as the number of maps in a specific mapping series. Record volume provides an important delineator of data handling requirements as it is usually a specific number where as the amount of data per record usually varies greatly. Record volume is important in both hardware and software development because record volume can overwhelm a data handling system just by the sheer number of cards to be punched or maps to be digitized, Records volume 32 is important in frequency determinations for handling and in retrieval. A system must know the peak load of requests to access records that it can handle, Data volume is an extremely difficult parameter to estimate, Counts may be conducted on sample areas, but actual volumes are generally not known until actual data inputting, Data volume may be only estimated, but it is important to estimate its likely growth rate for continued system data handling success. 1. Data density is an important contributor to data handling operations. It serves to convey the degree of difficulty to be expected in processing data from map sources. Data density is often expressed as the number of data elements per square mile at any map scale on any map. Generally, technical difficulty and thecosts of data input decrease significantly as data density decreases, so it is unec onomic for a system to accept data more dense than required by system users Data density is perhaps the mo st significant parameter in determining the magnitude of imputs, storage, and other data handling tasks of a geographic system. o2.1.2* Data Framework The data framework refers to the physical existence of data and the ability to obtain and use those data. Procedures for acquisition relates to data ownership. For example, many data are subject to provisions of confidentiality which may impact their use or collection. Also, their are tremendous differences in both the physical location and accessibility of what are often similar types of information. Continuity of data is significant if a geographic system relies on specific temporal or geographic coverages or extensions of coverages. 33 The important bot tom line parameter for data which is available relates to the process of extracting a relevant, reliable data base from the available sources. For example, data series such as soils and surficial geology often require additional interpretation at the time of input into the system . When availablef most parameters relevant to the framework for data acquisition relate to cost. The full costs of making data available and usable must be weighed against the costs of creating new data, When budget constraints permitactual data collection permits the use of data whose relevance and reliability can be ensured. 3.2.1.3. Data Evaluation Evaluating data relates to the applicibility of the data by examining documentation for accuracy and reliability. This step is important because any attempt to use other than error free data files can be disasterous in terms of data analysis, outputs and end use, and will jeopardize confidence in the entire system. "The quality of data is important. One bad piece of data can ruin a good data set. The ability of data to stand as evidence and be acceptable to both sides is the essential criteria for the construction of an information system. " (Bossleman, 1975) The level of effort-which can be devoted to data inspection usually reflects staffing, budget and time constraints. The advantage of secondary data is that often they have already been inspected and interpreted for some use, but, as previously discussed, since this previous use usually will not correspond with the new use, additional efforts aimed at data Idocumentation for evaluation are generally requi red. 34 3 .2.1.4. Data Most Often Included in Geographic Information Systems The emerging problem solving capabilities of state initiated and statewide resource planning and management associated information -systems calls for data typical of more localized sources concerned with land use and natural resources. This data has gener ally been concerned with planning processes related to aesthetic.. ecological, geological, hydrological, physical and socioeconomic values, conditions and trends/ along with. any number of factors that determine the suitabi Ii ty of land for various uses. .Data most often included are: soils, geology, vegetation, land use activity, hydrology, flood hazards, and natural features and to a lesser extent, cadast ral parcelsi streets, roads, and highways, utilities, zoning, census tracts, plans, redevelopment districts, and specialized factors such as moise leve Is, seismic hazards, and ambient air and water quality data. 3 .2.2. Software and Hardware Software and hardware are computer oriented terms which refer to procedures for manipulating data and the equipment that performs tha actual manipulations. An addition to standard software and hardware, geographic systems require specialized procedures and equipment to handle spatially referenced data. 3 .2.2.1. Software (System Procedures) With most systems, there are generally manual proce dures (instructions for individuals to fol low, such as in interpreting aerial photographs) and computer aided procedures (software for data base management and information analysis and display). Software will not be reviewed here but specific discussions can be found in Appendix A and Chapter 5 of this report in an I GU (1976) 35 report to the Geologic Survey on computer I system software related to geographic information. 3.2.2.2. Hardware Systems generally require both manual devices Cl.e., optical devices) and computer associated devices. The hardware for a system must be c apable of handling both cartographic and attribute datal allowing for its entry, retrieval, examination, and often modifications. in this brief section, the concern is with introducing special hardware requirements as an introductory exposure by the reader to automated central processing units, computer.memory alternatives and standard input-output devices is assumed. It is recognized that many geographic information systems in partial use or development, do not have any special hardware in that they merely use terminal displays on large time-sharing computers. But many new systems are cost effectively using certain types of special display and edit, drafting, and digitizing equipment and these are the focus of this discussion. The Digitizing Sub-system. Digitizing,the process of recording the areal dimensions of spatial data in computer readable form, can be done manually and automatically. As well, both grid cell and polygon systems can be digitized manually or using automated techniques. Grid cell digitizing can merely incolve the placing of a gridded overlay over a map and reading along each of the squares, allocate values for the parameters being considered to each square. However! a similar output can be gained automatically if the map is scanned by optical or vaster scanning devices. In digitizational of polygon data line digitization is the most common method employed. For map work, free penci I digitizers are most effectively used, in which the scribing device 1 This discussion is based heavily on I GU App@endix C and Geographic Data Software, I GU, 1976 800+ pages. 36 W 0 0";v kl ........... vi 31. INV x -41 n w 4 bqgf @ 'jW 7,." -i "k ro., @p @Pg -411 kr -Tu, 4, 1N 'y' Rt, '@5 M 'MR 5 11V.;@ i 7@' A `--, NI F- lt,i'k 01 pi, 11'1i,@@y J k k @,N vAl I- M ?: Figure 1.12 Example of Digitizer Operation (electric pen which records point coordinates along lines) is not mounted on an arm but can be moved and placed anywhere, the only restraint being the electrical connection. The alternative is to mount the pen on a trolley on the arm of a drafting table (see Figure 1 .12). One of the problems in off-line or non-interactive digitizing is there is no way to check the data as it is being recorded and there is no way to efficiently correct mistakes. The addition of a minicomputer controller to carry out the checking, and the reporting and erasure of unwanted points adds greatly to confidence in the data and the reliability of the entire operation. This cost is only slightly greater than systems with no minicomputer control. Automatic line followers and drum scanner digitizers are also available but are not widely utilized. Automatic line followers contain a viewing photocell which is carried by an X, y increment motor; between the photocell and the field of view is a segmented disc. The relationship of the! disc to black lines on a document serves as the basTs- for x, y, motor instructions for line following. A-. number of hardware (problems in automatic accuracy checking) and software (difficulty in locating islands) have prevented the widespread use of line followers in cartography. Raster scanners are more common and these involve a line by line sweep across a display surface to generate the map record. One basic problem with scan d igitization is the difficulty in converting from raster to vector formats within real time limits. Also there are few user programs which allow this to be done economically and efficiently. The difference lies in the fact that raster formats involve holding the data as hori- zontal lines and vector formats involve actual line by line encoding and storage. So most scanning 1 devices output directly onto magnetic tape and the conversion is carried out off line. Because of the great bulk of data often encountered, this conversion is a major data handling issue to be addresse However, the Canadian Geographic Information System has been operating efficiently in this manner for a number of years. The edit subsystem often consists of one or more displays associated by a minicomputer and 38 controlling software. Such self standing edit units currently cost between $25,000 and $100,000. The purpose of the edit subsystem is generally verification of data which has been either digitizing or scanned in terms of completeness, accuracy and cleanliness. To. properly accomplish this a first requirement for hardware should 6e.a self-stariding interactive display system-A second requireme nt is a manual digitizing capability to carry out modifications of cartographic data. The choices of display equipment for the system are numerous. The most widely used display is the roster refresh scanner which produces images by drawing horizontal lines on a television screen. Various vector type plotting systems are a6solutized. Other types of storage displays include loser @screens which are large television screens with very clear picture images. Such systems best operate when they allow interaction with the display for correcting, updating and modifying the displayed information so storage of some type is generally an integral part of the edit subsystem. The drafting subsystem is required to produce output display. Most display systems involve drafting units and plotters operated for off-line cartography and many such units contain mini- computers for control. Drafting units produce cartographic lines drawing in a con tinuous line mode and lower quality drafting units are referred to as plotters. Flat bed plotters and drafting unih look like ordinary drafting boards and drum type machine is a revolving drum with an axial motion for drawing. Generally, drum units are preferred because of ease in manufacture and maintenance and better speed response in operation. The quality of the map wh ich is obtained from either unit type depends on the quality of the mechanism that determines the size of steps in the x and y increment motors. This -,imply refer,; to the drawing pens which are driven vertically and horizontally by separate motors relying on these motors acting simultaneously to produce angular or curvilinear line pattern;. Microfilm plotters, electrostatic pointers, electron beam recorders and light heads are also available but few systems have currently incorporated ei ther because of hardware expense and complexity. 39 This discussion is both brief and incomplete but it was only intended to point to the range and functions of important peripheral equipment types. The next section will deal with digitizingf editing and outputting functions in much greater detail. In summary, most of the minicomputer systems curren tly being utilized in digitizing and editing functions have cost on the order of $100,000 -for complete equipment and software. Over a five year period, 1 000 hours per year, this would mean an hourly rate of $20/hour. This can be compared to an hourly rate of $250 + for large time sharing computer systems. When it is appreciated that data handling functions and search operations are as fast and efficient on minicomputers, the economics of the smaller units is appreciated. As well, a review of existing systems seem to show that systerns with smaller in-house systems have much faster turn around, which must be considered if visual interaction with large data bases is desired. 40 3 .2.3. Technical Operations of the System 3 .2.3.1. Introduction Technical operations relate to the total set of oper ations which must be performed on data to achieve requisite products from the system. Data input functions cover all operations needed to develop an error free data base ready for analysis, These operations are important because documents used to create.data files are error prone and in the past, human interpretation has been the primary data check. As well, data inputting covers processes which relate to the analysis and transformation of data to bring them to a standard base. So data manipulation at this stage only provides for checking the original information and placing it into a computer. ready format (see Figure 1. 13). Th enext group of techniques relate to the manipul ation of data to change it before some analysis. This is important because the needs and requirements of many analysis techniques require that the data be transformed to a format suitable for that analysis technique. All ma"jor geographic information systems contain the capability for data manipulation prior to analysis. The third group of procedures relate to data retrieval and analysis. These procedures encompass management of locational and attribute data, retrieval from the data base, and any procedures for data update, further correcting, reformulating or deleting. The final group of procedures are information output procedures. These relate to methods and formats of tabular and graphic output and to validity checks and veriFication procedures. 3,.2.3.2. Data Input Functions The first step in data inputting is an evaluation of the quality of the source material. Data documentation should be available to make the following 'judgements: 1) is the source. acceptably error free ? 2) Are there extraneou s data ? 41 FA ACOUIRED, INSPECTED, INTERPRETED TAI MAN UAL ORGANIZE DATA TRANSFER IN USEFUL FORM AND INDEX MAP AND STORAGE DRAWER FILES 114TERACTIVE MANUAL COMPUTER-AIDEO COMPUTER-AIDED f ENC ODE ENC ODE ALL TRANSFER ALL DATA DATA Data Base Management T APE/DISK STORAGE TAPE/DISK --- -------- ------ - - - - - - ASSEMBLE1 SELECT AND SELECT AND RETRIEVAL DATA BUILD WORK BUILD WORK FILE FILE Data Retrieval DISPLAY and Processi-ng MANIPULATIONS MEASUREMENTS1 EASUR MENTS MEASUREMENTS COMPARISONS M PA R I S 0 N S COMPARISONS DECISION - MAKING PRODUCTS /DISPLAY Figure 1.13 Geographic Information System Options (source: Tschanz and Kennedy 1975) @AN A; ER I @VS E 42 3) Are there missing data? If requiredcorrections are usually performed before actual data entry, Correction is a manual process and is likely to introduce additional errors and failure to detect them can lead to a very costly editing process after graphic to digital conversion (see Figure 1. 14). Many source documents require reformatting before conversion from graphic to digital data. The nature of the conversion process, particularly the level of automation, determines whether or not reformatting is required. An example of thi s is preparation of line maps representing polygons on stable base materials to be'used on an optical drum scanner, If the graphic to digital conversion is likely to be error prone, the source material may be reformatted to avoid new errors. Reformatting may also be required to simply separate geographic data from attribute data. Once checked and properly formatted, it is important to estimate data volumes and relate them to formats to be utilized. The desired relationship between real world elements and their data representation must be studied to ensure acceptable levels of information loss. Before data entry,initial classification schemes must be developed using inputs from existing standardized schemes (i.e., standard statewide land use coding), user needs assessments, and actual characteristics of the data. Where flexible! a choice must be made concerning coding scheme, and the advantages of different schemes must be weighed as the complexity of data manipulation procedures are strongly affected by the nature of these codes. The final step before actual data entry (digitizafton) is partitioning the graphic data base. Source documents must generally be partitioned or combined into optimal working units. This step involves locating boundaries between records, determining appropriate coding, and developing procedures for linking records if spatial aggregation is required. The next step, digitizing, is the'major data transfer function in setting up a geographic 43 Acquire, Data Source I I I' k, V Documents Evaluate Step 1 Documents N Step 2 OK? N Edit? y Correct or Redra,;;r y Reformat Step 3 Document -4: for Encoding Graphic -to -Digital Conversion Step 4 Image Descriptor Data . Data E dit Encoded .4 Data Massive No. of Limited Correct Step 5 Error Level rrors? Errors Errors Error- Free I Step 6 Organize Image and Attribute Data into Useful File Structure Figure 1.14 The Digitizing Process (source.1 GU 1976) 44 information system. Digitizing is simply a sequence of tasks for encoding spatially oriented data. In its narrowest form it can be viewed as determining x, y coordinate values to describe the location of points, lines, and areas as they are depicted on maps. In a broader sense, digitizing can be viewed as the task of creating and acceptable, machine readable, data File fi-om a variety of sources. Figure 1. 15 illustrates a generalized process whereby a useful data file is created with digitization the necessary element in conversion. The figure illustrates the paths that may be followed to computer format geographic data. Actual data encoding involves separate tasks for encoding geographic and attribute data. For image or area data there are: A) identificati on and selection of the entity to be encoded from the data source; B) determining the location of entities by measurement with either manual or automated techniques; and recording the locations on some medium such as cards, tape or disc. For attribute data the tasks are: D) for each entity, recording on machine readable base, the codes for the various of th e attributes; and E) recording corresponding identification codes for linking geographic boundaries to descriptions for those boundaries. The options for identifying geographic entities are relatively straight forward, Point data usually describe separate entities while grid or polygon data rarely describe completely homogeneous areas, So explicit d ecisions are required on assignment of values and boundaries. The amount of descriptive information retained must be weighed against the problems inherent in the handling of various classification codes. 45 SPATIAL DATA ENCODING I Heal-world 2 Observation & 4 Graphic 6 Non-Coded Entity Measut-cment Format of - - - Conjputcj, Process Source Data Storage 3 Syrn olic -Op Vol-mat. of -4-Digitizing Source Data Digitizing-_`\ @A T %0 5 Digital 7 Di ital Machine- 9 0 Readable Computer Digltlzing_:::@ Codes Storag Conversion Processes: Current Practice Increasing PractLce Probable Future Development Figure 1.15 (sourced GU 1976) d f ncoding descriptive data, the encoding of graphic data is a much more Corn pa re o e complex process, as when complex spatial entities must be described, the location identifier component of data can become cumbersome because of large data columes. There are various methods to lessen these problems, however, it is easy and logical to represent point data by points in computer storage but it has been found convenient to represent irregular area data types by grouping information into arbitrary grids. This process involves a loss of information in order to achieve an easier data management function. Most s ystems in operation are based on grid encoding but some newer systems are now acquiring the ability to handle multiple data formats including line, point, and area data. The points codes for point data are generally recorded as x, y coordinates and coding can be accomplished by either completely manual means or with digitizing. For large computer it is typical to use a digitizer that is capable of recording x and y values for points assisted systems i used by an operator who moves th e digitizer point over each point on the map and depresses a key which automatically records relative x and y locations with respect to an arbitrary origin. The optimal line encoding method is to encode each line segment by identifying end and curvature points with x, y coordinates. Data in this format lends itself to specialized analyses such as network and flow analysis.. Encoding data to grid cells has been extensively done in the past. Typically, an arbitrary grid is defined, then overlaid on the map area to be encoded and then manually, semi-automatically, or automatically record the appropriate classification codes. Although grid cell encoding always results in a loss of information, the loss is not uniform and may not affect utility of the data. Loss, of informati.on relates to both grid cell size and the manner in which, classification codes relate to the data and how they are assigned. Not to be confused with grid cell, encoding a number of systems convert point, line, and polygon data to gridded formats for analytical use even though they were record- ed in other formats. This is because the computations associated with the overlay of grid data are 47 relatively simple when compared to other data forrnats. For areal data represented as polygons, the optimal encoding method is to describe polygon boundaries with strings of x, y coordinates to define approximate polygon boundaries. For most applications, the polygon is utilized because it leads to the least amount of information loss. Ideally, of course, systems should have multi-format analytical capabilities because such systems can measure and compare data encoded in different formats. The final aspect of data inputting is data structuring and storage. The important parameters associated with data storage are the medium (cards, tape, disc), the organization of data (file structure), and the advantages of various types of organization, as there are four basic modes of digitizing. The distinguishing characteristics are the amount of capital investment requires, speed of digitizing, degree of error proneness, volume capacities and the amount of labor required. Table 1.2 presents four basic methods for accomplishing actual graphic to digital data conversion,, The digital data files that are created will always require editing. The kind and rate of errors are a function of the digitizing method. Different digitizing methods create different general error types so editing requirements are generally very system specific as there is no list of typical errors which applies to all systems. In contrast to geographic boundary data, attribute data encoding is a straight forward process. Three important aspects of attribute encoding are: I ) accuracy of the coded data; 2) standardization of classification codes; 3) structure of the classification code. These three significant parameters relate to the fact that the advantages of various coding schemes must be weighed as the complexity 'of data manipulation procedures.and storage file structure is strongly affected by the nature of classification codes. 418 GRAPHIC -TO -DIGITAL CONVERSION TYPE* METHOD CHARACTERISTICS PURELY Manual conversion to digital format Minimum capital investment, slow, TVIANUAL or Key punching of symbolic records, prone to error, not suitable for or both medium or large volumes of data, labor intensive, and highly selective MANUALLY Point & Line '.1'racing Digitizers Low to medium capi 'tal investment A I D E, 11 Batch (for digitizing graphic ($ 1OK-80K), high selection capability, MACHINE' rccords with direct JIL11-11,'111 prone to error bul can have high assistance) editing capability, not suitable for Interactive large volumes of data, high labor requircruents, and nioderate speed Sli" M1 - Automatic kine J@ollowhLfL Medhini to high capital Investnient, ALIT(WA11C ($300K+), low selection but good editing capability, no hunian errors, suitable for large voluines of data, low labor requirements, and relatively fast AUTOMATIC Graphic Scanning Devices High initial capital investment, low OCR (Optical Character R ecognition) error rate, low capability for intelli- gent data selection (1976), suitable Digitizing at Source (direct from for large volumes of data, low labor sending devices) requirements, and fast Increasing Automation A continuum of methods can be recognized, wh1ch extends from those that are basically manual techniques to those that are highly automated. Four arbitrary subdivisions, of this continuum are given above. source: IGU 1976) The first step in data entry is partitioning the graphic data base. Source documents must be divided into logical workable units. This involves locating all boundaries between records, determining appropriate coding and developing procedures for linking records if spatial aggregation is required. The next step, the encoding (digiti zing) of graphic and descriptive data is the major data transfer function in setting up a computer oriented data handling system. The process involves recording geographic boundaries and descriptors. For each type of spatial unit (lines, points, grids, or polygons) the options for coding descriptive data (i.e., a soil or land cover type are as follows a single value for the predominate characteristics; recording the distribution of elements across the unit thus showing proportions; recording the presence or absence of entities; or recording the proximity of elements to other elements. The final data inputting step is to organize the data into useful data files for storage and easy retrieval. For large systems, there is a need for a number of techniques created just to create, maintain and retrieve data from storage. Such programs are referred to as data base management programs and will be discussed in the next two sections. Without becoming to specific, this area of discussion is important because data organization in computer memory greatly affects both the efficiency and complexity of retrieving and analyzing data for its conversion into information. For example, overlaying a number of maps with a certain file structure may be a trivial effort while another structure may require extensive programming. The simplest data organization is called sequential in which one piece of information directly follows another. An early structural variation on this which is seldom used today is direct storage where enough space must be saved for the largest variable even if all other variables have smal I data volumes. As well, if data is sparse a large number of potential slots may be wasted. Inverted file structures create separate tables for each entry (see Figure 1 .16). Data is 50 am m Viyuro 122.21 Figure 122.22 SC(ILlential Storage Inverted Storage of Grid Data of Grid Data 1,1 yes 0.2 ...... 1 1,1 yes 1,2 9.6 1,1 1 1,2 9.6 0.5 ...... 2 lip no 1,2 2 lip .5.0 2,1 no lip 6 lip no 5.0 .92 ...... 6 2,2 yes 1,N 17.2 2,2 4.1 2,1. 4 I., N 1.7.2 .64 ...... 3,1 110 2,1 no - - ...... 4 2,N 1.3 2,N 3 2,2 yes 4 .1. - - ...... 3,1 4.2 m , N1 2 2,N - ...... 3 M,N 110 3, 1 no 4 .2 - ...... . . M, N iio ...... 2 Figure 1.16 Two Types of Grid Data Storage (source I GU 1972) stored by columns which wields a good deaV of flexibility as wel I as being faster for operations such as overlaying but are also extremely complex in their arrangements. The above structures are applicable'to grid formats because of fixed lengths for each record but polygon data is difficult to arrange into'standard size units. With polygons, grids are not used and data density from data type to type may vary greatly (i.e., census vs. soils data) and polygon size varies greatly, so in polygon systems there are generally both single short records and very lengthy complex records which must be combined, overlayed and analyzed. The difficulty arises because generally computer storage is linear. Approaches for polygon storage often involves iust listings with data positions signified with various types of pointers. Pointers are values which are organized to easily locate data with complex structures. With pointers for referencing the actual data, items can be added, deleted, located and retrieved easily. Numerous pointers are required (one for every data element), but the trade-off for being able to easily reference variables has proven to be a satisfactory one. In general, but with exceptions, the process of data update is simplest for simple data structures and more difficult as complexity increases. However, this depends on the nature of the required update. In many systems it is eas ier and faster to add a data item than it i's to delete or change one. The cost of update can usually be decreased for most structures if the updating can be botched, that is, if the data to be added or changed can be collected for a period of time, and all changes made at once, during a single run of the update program. From an information systems users the probf em of data organization is to select and design a file structure which achieves the appropriate balance between considerations of flexibility, transferability, reliability, response time an d costs. One should note that cost is composed of several factors: storage volume, amount of processing time, the required computer, costs of development and time costs. The last two costs can be decreased by using existing systems or parts of systems 52 whenever appropriate. With modern computers, there are a number of storage medis which differ mainly in their relative speed of access and cost. The basic relation is simple: the faster the access to data, the higher the cost. Thetwo common forms of storage outside of core memory which is in significant are discs and tapes. Tape provides a low cost sto.rage medium which is slow because tapes must be mounted. A system oriented toward non-interact ive inquiries, where reponse time' between the initiation of an inquiry and production of results may run from hours to days, can make good use of tape storage. However, a system which is required to provide quick response to a wide number 0 f users on an interactive basis must maintain a data base on disc in order to avoid major delays. Table 1-3 presents relative cost trends for on-line storage. 53 able 1.3 TYPICAL COST OF ON-LINE STORAGE TECHNOLOGIES (or@fe-r-of -magnitude f igures) Cost includes transport mechanism cost Size of Storage TYpe of Storage Cost per Bit (millions of bits) Access Time Register $10 0.001 0.01 microsecond High-speed cache $1 0.01 0.1 microsecond Main memory 10 cents, 1;10 0.5 microsecond Large core storage 1 cent 10;100 5 microseconds 0@_-um and fixed-head disk 0. 1 cent' 10;100 0.01 second 100; 1000; 10, 000 0.1 second Movable-head disk 0. 0 1 centi Magnetic strip 0. 00 1 cent 1000 to 10, 000 1 second "Tass (archival) store 0. 000 1 cent' 100, 000 to 1, 000, 000 10 seconds Vter: Martin, 1976) 3 .2.3.3. Data Manipulation Procedures This set of procedures is required to establish a required bridge between data entry and data analysis. This step is necessary because it is' often important to change data from a format which is efficient for storage to a format which is efficient for analysis. For example, data may be efficiently recorded by lines and vertexes but the system' may require data to be recorded on the basis of cells; for efficient analysis. A flexible system wil I contain capabilities for a number of manipulations to convert data to formats that provide efficiency or various analysis procedures. Three types of manipulations are most important: (A) methods of projection change;(B) methods of mode change; and (C) the joining or separating of data records. (A) Many map projections have been developed and a number are being used in spatial information systems. The ability to change data for a file from one projection mode to another is a desireable system capability. Generally, a system must be able to handle free x, y coordinates, geodetic coordinates, and plane coordinates. Projection change using the computer is becoming a relatively simple process. Transverse Mercator Projection, UTM-Coordinate System, and the State Plane Coordinate System are the primary geo-referencing systems in the United States. 54, (B) Format or mode change involves the operations of, reclassifying descriptive data, changing data and merging similar value records. Important aspects in this are changing line data to grid data. This operation has proven difficult because of the required complexity of programs for interpolating grid area boundaries from irregular p olygons and uneven grid borders whichresult from polygon double digitization. (C). For analysisj data arrays must frequently be broken down into sections due to limitations 'in computer memory size and ana-lysis procedures. Records to be separated or joined are generally rectangular and the same size for keeping the math involved relatively simple. When data sets are grid rather than lineal or polygon the problem is trivial. For polygon data, data strings must be tested for bo undary violations once joined and this is a relatively simple task. The entire process though relatively simple is critical because a small amount of mismatch can produce critical errors once separate maps are joined. 3 .2.3.4. Data Analysis and Retrieval Retrieval refers to locating a file (data set, series, maps, photos) within the storage of the computer and extracting desired data from the retrieval file. Data analyses are the set of operations performed on retrieval data to produce information. Retrieval and analysis procedures include- retrieval from storage analysis on locational identifiers analysis on descriptive data analysis on locational identifiers in concert with data elements The retrieval andanalysis parts of computer aid spatial information systems appear to be quite varied, but there are several generic classes of computer aided operations which will be discussed. Basic procedures involve data classification and agg regation. Though systems should ideally 55 accept data completely unaggregated, the amount of data would be staggering. The basic principle central to having workable amounts of data in generalization. This onvolves the categorization of different data elements into large more homogenous areas of differing levels of specificity. (For example, a statewide soils data base may contain soil types which have to be aggregated into soil series for analysis because of data volumes.), Class intervals., defined categories and defined locations are all tools for data aggregation. Some important data analysis procedures which are common to many systems are: A) Search - this is used to locate single items; sets of defined items; locate undefined items; locate items based on data linkages; locate items based on criteria in the data set; and search by location. B) Contour - i's the process of drafting area value maps from point data. Q Overlay - consists of matching different data sets for the same area to produce composite overlays. Grid overlay is generally used but polygon overlaying is developing and most systems currently being implemented or designed are polygon oriented. Other procedures which are often readily applied are contour, smoothing, interpolation and extrapolation, trend analysis, path search analysis, and area calculations. 3.2.3. Data Output and Display The output from spatial information systems need not always be a spatial display for numerous simple land resource associated models need nothing more than a tabulation of numbers. For example, a comprehensive land resource information system might use the following outputs: A) Production of graphs, histogramsf etc., showing relationships between variables and indexes. An example is a graph comparison of existing land uses at different time intervals. Systems should be able to produce such information interactively. B) Spatial plots defining existing and historic conditions (economic, environmental, etc.) 56 of anywhere in the geographic data base. Q Density plots showing information such as population or levels of agricultural production. D) Overview plots wherein topography is shown and land uses are plotted on the topography. E) Controus or isotherm plots to illustrate parameters such as degree of ai r pollution. In summary, map and tabular displays form the primary building blocks for geographic. information systern- utility. The. advantages and effic @ency of various displays are well documented; however, the versatility and timeliness of comput er graphics will earn an expanding role in the presentation of spatial information. Good computer displays have to only present the most pertinent inforTnation since additional information can be retrieved from a good system quickly and inexpensively. (See Tomlinson, 1972, for extensive review of system graphics.) 3.2.3.5. Technical Procedures Summary Both polygon and grid cell encoding are widely utilized. The polygon form is generally used in systems where accurate cartographic retrieval is important and the grid system is more widely used where the ability. to overlay efficiently is essential and where a range of demands on the system are generally well defined. Table 1 .4 displays the advantages and disad Vantages of the geocoding options. Interrelationship between the level of data aggreg ation, system requirements and analytical tasks related to data use and geocoding are presented in table 1 .5. One of the major problems in encoding spatial data is the lack of me' asures which can assess the effectiveness of the encoding. Comparative measures need to be dev elop ed that can make quality control determinations in the process which extends from source document, through capture, to use. As well, there are few effectiveness measures that relate to processes of updating, edge matching and retrieving data. 57 Table 1.4 CHARACTERISTICS OF GEOCODING OPTIONS Geocoding q2tions Grid X-Y Coordinates Simple Polygon" COMPICA P0lYq,;'116 Predonitrinnt 911111tLettive Area Mona. Polikta 1,11113 @;vrj11j41jjt (fi!w vurticvn) use small colla, Prnienco Distribution Point location, Linear dnta, e.g., Regular areas, lrregulav arcivi, Isrodominant 0Y abnancn of unou area controid, ntrcums, roado e.g., jurifidic- o.q., iwil, covurt 1191n or n-awron uniaplo poinLu iyj)o or owt.wvric or tiolkal boultdark(lu, Liloptl firt11111, 1111al UVAS )to covorn-job cvjjqAt.x covoralles oncoding Uf 1111OLI and Aroau Encoding Goannor Res(:ord printary, Manual Point digitizing Vuhkt or astroam, VoinL digitizing Stroom #UqtLixiny methorl n.-cond'-wy, and grid, grid d1.9itizing wiLlt or autoltvitig 11110 (-n tertiary of Volygon wiLliout arna coding fol loving OD occurroncas data AdvanLaqoa Ono category llxplioit oncod- rxplicit oncoding per coll rnroda till data once Minimal Replication of ing of area data of complex area [ot: aAch call encoding Untiar JaUt for data cdrto' -IrAPIAU purpoDA-41A MUSt procCE13 1)01Y904% data to Dinadvantagoo Separate ancod- Exact coincidence of Boundaries Line segments onuuro exact maLchinj of Ing for each categories not ancode4 not encoded need additlonal adjacent polygons covoraiju many processing to calls generate polygons (source Duecker 1975) M MAN M M M M M M Table 1.5 LWEL OV DATA ACCMGATI014, SUTI;m W@QUIRX.'CNTS AND ANALYTICAL TJLSKS AS A EUNCTION OF DATA USUAL'I'LICATIO30 AND GLOCODING OPT103,1 ExtornAl Geocoding options Index Grid )-(,Y Coord (Itq Placo Nuna FinQ Polygon Arua Coarvo Polygon (many vart:jcas) or Codo Predominatit RM2L!j!2@@ Maitsurar.'Int voialts Lino Sa:nont -tfcjw vertices) Grid P Y-9-0*1 Wtu-ork Policy Planning S11 AO, Ml Al Sl,Al,H3 Program Planning Sl,A2, M3 a3, Land Inventory SI.AS,N4 Sl,A4,M4 S2,A7,M4 S2,A7.,M4 S2,A7,M S3, A7, M4 ImIjact &USeuGme"t M4 H4 E2 X2 IH4 IA4. A7,m M5 X5 A3,MS S2,V, M5 S3, AS,V.5 10 Land Capabil Iity S2,A5,M4 S2,A7,H4 .S3,A7,M4 Regulation A7,V.4 33, A8,M6 YWY I 'QUIRL SV',TL:1 1U, NTS DAM LLIvur, ol, AGGRrGitTION ANAVITICAL TASKS TWI-PtL @pose cumputor AO State It and Ca rd . t, filca Al County otkd projoction modole Celleral data manaqement sof tware AZ VURQr Civil Di.v1slon/Traot M2 OptiiiLLI location/allocation, m9dols 140 L;Pucialized Pcriphoral equipment A3 Erumaration Diatrict/alock, KI Gl%-tiisl Intexactlon models Group H4 Spatial associatio.1, measurement, And dinplay techniques S2 General purpose computer A4 Square mile/si@uaro kilometer M5 Diffuaion models Dig1tizar, plotter A'S 40 acr@z/9 hectare X6 Racard keeping and monitoring systems Specialized software for data capture A6 one acro/one hectare A7 natural areas RT, plotter Ae ownership parcal S3 Dadicat-.d computer with digitizer, C. Speci"izod sofEwaxa for data capture (source:Duecker 1975) 1 .2.3.3. Users of the System The utility of a spatial information system in meeting perceived resource management needs is the justification for a system's existence. However, even when systems do meet perceived needs, there are additional obstacles to complete system utility. These obstacles include: the potential users perception of the system; user's inability to express problems in quantitative terms; familiarity and bias toward more traditional techniques;, and perceptions of costs and benefits of the system when compared to other alternatives. At the absolute minimum, a users relation unit should be established for any system which should be managed by persons actually associated with system development. User relations include user needs, user access arrangements, user training and formal arrangements for the system to react to user feedback. The following elements should ideally be a part of any system's dealings with its users User Needs perceived data elements perceived manipulative capabilities required outputs User Access direct user access technical assistance formal user organizations User Training system use approaches to quantitative problem solving expectations of the system use of interim products during development 60 User Institutions user committees for system evaluation user input into policy formulation on system development, modification, and operation User access to the system is the primary interface since this is where the level of actual system utility is determined. Depending on user experience, the interface can be through computer oriented technician and the sophistication and complexity of the User system interface must increase wi th the sophistication and complexity of the information system itself. A usertraining program must include system access methods and the user must be trained to approach problems with a quantitative perspective in order to understand the range of capability being offered by the system. The problem with most systems currently operational is that system development programs have not operated in time frames that allow extensive user training. As well, meaningful user feedback should follow from the various aspects of the user interface and training. 3.2.3.4. Management of the System One of the most neglected aspects of spatial information system development is system management.. This is not limited to managing system development but includes the entire management perspective for complete operation of the system. The institutional setting for a system is important because many systems have started with external funding and many funding sources cannot be relied upon to ensure survival of the system. General policy statements are imporiant and must address issues such as mode of operation (i.e., schedules, priorities, classes of service),interagency agreemen ts, and data confidentiality. Several kinds of planning activities are important in the management of information systems. Planning program staff and staff changes is important. A major concern should be detailed job descriptions identifying all the skills and capabilities required for operating a system. 61 A long range fiscal plan is needed because obtaining funds for system initiation has proven to be relatively easy and funds for system continuance or upgrading are generally much harder to obtain. Three vital areas must be addressed in a fiscal plan: total funding; development costs separate from operational costs; and fiscal continuity. The problem of development costs versus operating costs must be examined rather closely as the distinction bewteen the two is not always clear. However, a distinction between the two should be stated, even if arbitrarily, and should be based on funding source, system operating policy, and expected user community considerations. In the past, as well as presently, funding continuity often hunges on the delivery of interim products, either for actual use or for system publicity. In summary,. good documentation for of I aspects of system management is essential and should include: the system development plan the staff plan the fiscal plan operational cost information technical software descriptions user education programs user manuals system use records all interage ncy agreements. 3.2.3.5. Procurement of Hardware and Software The final element of geographic information systems is the procurement of hardware and software which generally involves preparing detailed specifications and evaluating alternative ways of performing the technical tasks Procurement of the data handling equipment and associated software can involve the following: Hardware competitive bidding evaluation of proposals 62 Proposal. check-out procedure performance records maintenance records Software acqui re existing packages software testing development of new software. 3 .2.3.6. Sum mary of.Sec.tion III Section !TI has presented a brief introduction to the structure, scope, and potential utility of geographic information systems. As well, its purpose is to present terms and definitions which will be used in the remainder of this report. Essentially the chapter should show the reader that geographic information systems entail far more than computerized cartography. Many planners have looked to simple computer mapping as the answer to their problems when in fact far greater problems with data collection and management may be facing them. Geographic information systems in their fullest sense provide cartographic capabilities but also the opportunity to construct efficient logical frameworks for data base development and management and the coordination of natural resource information users. 63 SECTIqN 3A THE DESIGN MEHODOLOGY 3A.1 Introduction With geographic data handling systems, most of the important developmental work should occur prior to the actual design of the systerh to allow systematic and logical approaches to problem solving if anticipated system capabilities are to ever be met. In the early stages of development the approach should be independent of actual data content and the constraints of hardware and software so that when developed, the system can be applied to any data that responds to the anticipated needs of anticipated users of the system. Geographic information systems are not ends in themselves but are only tools for planners, managers and decision makers. The economics of such systems are a function of the intended purposes of the system and the actual level of use of the information produced by the system. For a system can reflect ultimate technical efficiency and not be a success unless the needs of users are being met at a reasonable cost. The most difficult aspect of system design to accomplish, because it has been essentially ignored in the post, is the identification of the critical aspects of the interface between the capabilities of a system and the various decision making processes of potential system users. To better understand these processes design must focus on the concurrent development of a number of system sub-systems-data acquisiton, data processing, data analysis as a response to user data analysis needs, and integral management planning of all aspects of the systems operation. All of the many stated purposes of geographic information systems can be generally summarized into one primary objective-the conversion of raw data into usable information that is actually utilized. The use of any information from a system begins with the final output 64 being transferred from within th e boundaries of the geographic systern. to outside the system-the domain of each individual user of the system. Therefore, system design should deal with issues which are actually beyond the scope of the system in order to understand user decision making, user data and information requirements, and required desired and information uses which will be central to the activities at the various user communities. 'For the true benefits of using an informati on. system is determined by improvements in the performance of user decision making which are achieved directly by use of the system. 3A .2 System Economics and Design Marble,et. al. (1972), state that "the economics.of geographic information systems use is quite simple: are the benefits worth the cost? " Costs are basically a function of what the systems capabilities are and how well those capabilities are executed. The components for deter- mining true system costs are twofold: the direct monetary outlay to design, build, and operate the system.; and, secondly, are the intangible costs related to the legal, political, and social realms of system functions. Such costs simply relate to the question: are there potential benefits to be derived from implementing no geographic system at all? For the first cost, data collection and inputting costs., system development costs, system operating costs, and system maintenanc e costs are all generally quantifyable for entry into the decision making process. The second cost,.however, presents a number of considerations which at best can be only examined and estimated and perhaps only inferredi but which are significant nevertheless. Related to these costs are assessments of information system use and potential use once implemented, which are extremely involved and complex assessments which are possible only through understanding the decision making processes of the various potential users. However, to date geographic information system use has generally not advanced to where these user oriented 65 parameters are generally observable and quantifycible in terms of actual reliance on the various systems currently in operation as vehicles fo,r imporvement in user decision making. This previous discussion i's important because the design approach to be outlined and initiated in the remainder of this report is based on the assumption that the economics of implementing a geographic information system are conditioned either positively or negatively by the design and development process. The first step toward maximizing positive system economics is the development of a systemati c, long term, and interative design process. The end result of such a process is ideally a matching of information system output capabilities and the requirements of the community of system users. 3A .3 Initiating A Design Process The basis for the development proces s presented here is basically an interat ive design process in which a range of alternatives are thoughtfully derived and then tested. Alternatives are weighed against system objectives before significant commitments are made prior to actual system design. Quite.logically such a process begins rather generally with decisions of greater technical complexity and detail required in successive iterations of the process. This report in the context of the proposed process is regarded as being part of a first iteration and broadly covers the various concepts of perceived system need, a general review of data and other resources which are available, general system specifications derived from new continuing research priorities. Later iterations will represent the first real visible attempts to establish a geographic information system for Maine, should this process continue. By this time, the apparent needs and objectives for development should begin to surface to the potential system sponsors and advisory planning groups which should be actively involved in the design process once it is truly underway. 66 3A.4'Design Methodology The proposed framework for the design of a geographic information system is based on the assumption that.such a sysItem consists of: (1 ) al I potential system participants.; (2) a processing capability that often involves computer hardware and software.- and (3) data. System performance, recognizing these three elements as being part of the systemj ca n* be affected by: (I )'the manage- ment and administration of the potential system., (2) the legal-institutional status of the system- and (3) actual political response to outputs from the system. Recognizing these system components a three stage information system design methodology (see figure 2.1, Calkins 1972) which includes six integral steps is proposed. These steps include: the determinati on of system objectives; an assessment of data availability; an assessment of pro- cedures, hardware and software which are available: a compilation of general system specifications; generating alternatives; and selecting a system or specifying a set of procedures that will satisfy the objectives of the system tothe maximum possible extent. Each step in the design process involves several types of decisions which have to be made, each of.which relate to one or more decision variables. Such variables are regarded as decisions which are made in the process of designing a sy stem. For example, for each variable at every step there are generally several options open (i.e., storage or data on disc, cards, or tape and selection ,of map scales for inputting-]:] 00,000 to 2:24,000) and each decision must be made explicitly recognizing the' numerous interdependancies between such decisions. .If the six primary steps from the process outlined in fig. 2.1 can be aggregated into the three stages then this report represents a first iteration of stage I . The systems approach delineated in figure 2.2 presents the general.steps which must be a part of each iteration in the design process. Because the process is viewed as being systematic there are required feedback mechanisms which must be eventually structured and monitored as part of the process. Figure 2.3 presents a general 67 1. 4 Inventory oxistilig (Intit UoUrc OU luld- STAGE I collection Ore ri 110 1. 0 Describe data out 1. 2 Describe specifications and evalunte STAGE, a 1. 1 Deficrillo data needs 1. 7 Describe 1 9 Evaluate objectiveg, Infornintion system specift- client, and delivery cations and 13011CM8 I client needs 1. 3 Describe requirements_ ob, ectives P. Costs 1 3. 1 Filial and evaluate I evaluntion geographic 1. 8 Describe Impacts reference geographic needs referencing system 'nventory 2. 2 Describe Fe grp lit, hardware r fe re., Ing requirements .y.t. PT. D_..cr1be 2. 5 Evaluate so tware feasibility CO SrAGE 2 requirements and cost T- 10 2. 4 Describe 2, 1 Describe operating alternative environment Information systems 2. 6 Describe legal Impli- /Z 8 @]Ev-lu@ate cations legal and 2. 7 Describe political political Ini licattJ Im lications L - - - - - - - - - - - _j Vvedbfick Figure 2.1 Geographic Information System Design Methodology 6De crIbe 'a P, Ele gto at "at ns 8 Eval. 271). rib@ @Ie I pgitt.'T 0 p lit I,.I In, Mail. o a Imlictio.. (source:Tomlinson 1976) 1. Problem Formulation 2. Objective Definition F 3. Alternative Generation 9. Generation of New Alternatives 4. Information Collection 5. flodel i nq 6. Benefit-rost Evaluation 7. Questioning Assumptions 8. Re-examination of Objectives Figure 2.2 A Systems Approach (source:Caikins 1972) 69 1. Identify Clients and Client Objectives 2. Develop Information system Objectives 3. Sp!ciify Alternative Objective Sets: Data and Analytical Capabilities Capable of Meetinq Abjective-s 4. Evaluate Object' e Sets ly for Effectiveness S. Has Acceptable Objective 140 Set Been Found? lat; Alternative Systems 2i :Eyluate Alternative Systems FpagiMlitX AM Att F7. 8. Has Acceptable System,@@@ Design Been Foundyl@ 9. Compare Costs Ste with Benefits Rtepp 47@ 10. Are the Expected Eenefits Satisra-M-ry-\ wi th Respect to the Expected Costs? F E M70 Figure 2.3 GENIERAL MODEL FLOW AND FEEDBACK LOOPS (source:Tomlinson 1972) 70 outline of how such feedback can be tied to each iteration of the stages in system research and development. 3A .5 Linking the Design Methodology to Geographic Information System Components The previously introduced tasks which are proposed to be part of the design program can be related to the geographic information system components which were discussed in the second half of chapter I . The relationship of the primary componenit (i.e. data base management, data stora ge and retrieval, etc.) to the primary tasks of system develo pment is presented in Table 2.1. Table 2.2 illustrates a similar relationship for the design methodology to the various components of system support. The value of these displays is that they illustrate which topical areas need to be carried out simultaneously or separate from other aspects of the process. 3A .6 Linking the Design Methodology to a State Resource Data Planning Program Many of the tasks identified as being integral to the proposed system design methodology are also central to a developing statewide data planning effort for which the need is widely recognized and preliminary formats have been proposed. As @he basic inten@ of dat:3 p.'.:jnning is provision of an organized framework for coordinating state agency data related acrivities, the line between data planning and information system planning is for the most part negligible. Data collection planning, data collection, data interpretation, dato analysis . data storage, and finally data distribution have been initially identified as required components cen- tral to the development of both efforts. Data plan ning provides for inventory natural resource information, assessing user needs for various data types, and preparing background reports on the utility of these data types. Data planning is also tied directly to Federal collection ac@lvities, the collection activities of other state and regional agencies and possibly data collected and 71 GEOGRAPHIC INFORMATION SYSTEM COMPONENTS STEPS OF THE Data Base Data Retrieval DESIGN METHODOLOGY Management and Processing System Support Task Identification of data Identification of el ements user and user's needs Determination-of System Objectives Identification of geographic location identifiers Task 2 Identification of available data Assessment of Data Documentation of Availability available data Investigation of geo- graphic referencing -Sys tem Task 3 Survey of available Survey of available Availability of procedures procedures trained personnel Assessment of Existing Survey of available Survey of available Legislative and/or Procedures, Equipment, and equipment equipment administrative basis Manpower Resources for creating an information system Task 4 Data specifications Geographic Detenuination and Evaluation Geographic referencing system of System Specifications referencing system System products - Task 5 Transfer and storage Retrieval, analysis, alternatives and display Determination and Evaluation alternatives. of Technical Operations Alternatives Relationship of the Design Methodology and the Components Figure 2.4 of a Geographical Information System (source: Tsachanz and Kennedy 1975) so MW M on No 00 W* SYSTE14 SUPPORT TOPICS STEPS OF THE Use and Minaqement Procurement' DESIGN METHODOL06Y User Access Requirements Procedures Task I User access arrangemonts Determination of User training System Objectives User needs Task 2 Assessment of Data Availability Task 3 User training Long-range staff Software Procurement plan Hardware Procurement Assessment of Existing Procedures, Equipment, and Institutional Manpower Resources setting Task 4 Formal institutional Documentation arrangements to Product publications Determination and Evaluation accept, evaluate, plan of System Specifications and act on user feedback Task 5 General operating Software Procurement policies Hardware Procurement Determination and Evaluation Long-range fiscal of Technical Operators Alternatives plan Fi 5 gu!" Relationship of the Design Methodology and System Support Topics (source Tsachanz and Kennedy 1975) archived in the private sector. Interpretation and analysis primarily. is designed to provide technical assistance in the use of surficial geology, wildlife resource, water, soils vegetation, and marine resource.data. As well specific analysis such as assessments of development suitability may be undertaken when required,. Data storage, retrieval, and distribution planning will certainly be required to examine the storage of imagery and mapped information, manual vrs. computerized data handling, and alternatives in distributing this information to the user community. Appendix 2 of this report presents the outline for the proposed data planning effort. Table 2.3 presents a summary interpretation of the linkages between the geographic information system design methodology and the proposed data planning program. 74 THE DATA PLANNING PROGRAM DESIGN DATA DATA DATA STORAGE, RETRIEVAL METHODOLOGY PLANNING COLLECTION and USE INTEGRATED LOCATE AMOUNTS, TYPES AND SYSTEM USER DATA SOURCES DEMANDS FOR OBJECTIVES NEED FOR DATA ASSESSMENT G. I.S. .PLANNING TO DATA ACQUISITION MAXIMUM EFFECTIVENESS IN DATA INSURE FOR TERMS OF DEGREE OF AVAILABILITY SYSTEM G. I.S. AUTOMATION, COVERAGE, CON TI N Ul TY RESPONSE TIME, ETC. HARDWARE MAX EFFICIENT DATA CONTINUITY EFFICIENTLY UTILIZE AND UTILIZATION FOR STATE OF THE ART SOFTWARE 'OF COMPUTER HARDWARE SOFTWARE & HARDWARE HARDWARE PLAN TIWETTBLE PLAN FOR BEST MATCH SYSTEM SYSTEM FOR BETWEEN REQUIREMENTS S PECS SCOPE ACQUISITION AND CAPABI LI TI ES & UPRATING SELECTAND LEVEL DESIGN IM PLEM- EN T 0 F PLAN FOR BEST ALTERNATES BASED ON AUTOMATION MATCH EVALUATION I CRITERIA TABLE 2.3 DESIGN METHODOLOGY DATA PLANNING PROGRAM STRUCTURE LINKAGES 75 SECTION IV ELEMENTS OF SPATIAL DATA SYSTal DESIGN 4. 1 Determining System Objectives .06..ofooo 0 00 .00#0*0*00000 .04000*000.0.40 .0-78 I int ro ducti on. . . . 0 * 0 0 0 a 0 0 * 0 . 40@0 o - o o oo9 - o o * 9oo w 9 oo o oa o * 78 2 Identifying Users and User 3 Identifying Data and Data Handlin g Requirementsto.*&**.*****.*oooo**#*79 4 Determining Geographic Identifiers ...... .0 ..... assoo.09to.*6*0080 5 Identification of System Usersooooe#ooooooo 6 User Issues to be Addressedoes-ooeog.o.oseeome@4*00060.*Ot.oeooooo.***84 7 The User I The Survey Results ..... Dept. of Inland Fisheries & Wildlife .......... Dept. of Transportation-**.* Dept, of Conservationoe*oooooa,tooooooooooo*ooooote*eoo**ooooooo**@987 Bureau of Forestry ......... Maine Geological Surveyoo*****oooo*ooooooootooo**Oo*o0069000004089 Bureau of Parks and 00 Land Use Regulation Commissionoooo****ooo*oo*oooooooe***eoeoooo#90 Dept. of Environmental Protectionoeeeooo*ooo*oooo*oo.o..ooooeo*ot**gI Office of Energy Resources***&**# ... State Planning Officeoo..oooosooooooooooooo***o*oo#*****ooooooooooo93 2 Composite User Needs.ooooosooe 3 Summary of User Needs Assessment .....o....... 0 .... so*.*.* ... 0040#.0100 4 Required Future Activities.o..Oooo#oooeo***oeo*oooeo**ooo*oeeo#o*o*100 Assessment of Data Availability ... a 0 0 0 V & 0 * 0 0 0 0 0 0 6 0 6 6 0 * 0 9 6 9 & a * 0 0 0 0.I W I Introduction**oooo***oooo..o***ooao.ooo**..** ... ***ooo.ooo.oo*ooooe9o*101 2 Role of a State Data Plan in the GIS Availability Assessmento..0600064101 3 Data Identification&oooeoe*o**o**o ...... 00.....0 ............. ........ *.102 76 4.2.3.1 Federal Data Sources 103 Indexes of Natural Resource Information..................103 Federal Agency Data Bases and Data Systems...... . . 107 Dept. of Agriculture . . . . . . . . . . . . . . . . . . . 108 Dept. of Commerce..........113 13 Dept. of Defence .........119 Dept. of Housing and Urban Development................................121 Dept. of Interior..............122 Dept. of Energy.......................................130 Environmental Protection Agency........................................l3l .2 Identification of State Data Sources ................132 4.2.4 Data Documentation........................................133 I Documenting Existing Non-automated Spatial Data..... ...133 2 Documentation for Automated Spatial Data Files................135 4.2. Spatial Referencing.... ... .... 136 4.2.6 Data Availability Summary................137 77 4.! DETERMINING SYSTEM OBJECTIVES 4.1 ;1 Introduction Determining the objectives to be met by the system is always the first major task in system design, yet it is often the most difficult part of the design process. Resource planning and management, because of its strong interface with political decision making, often does not lend itself to clear, concise statements of objectives. Despite this, difficulty the process must be initiated with an attempt to determine objectives by specifying the various data requirements and products applicable to the perceived resource management problems of potential system users. The process must be iterative to the point that it should be carried out during and after a system has'been implemented to allow performance evaluation to determine if changes are required or desired to increase the utility of the system. With such'a model being used, the process is applicable to initial system design and subsequent modification as the system should never be viewed as a static tool but as one which must be able to*be altered quickly to respond to changing land use and resource planning demands. It is proposed that the objec tives be determined by defining: (1) the system objectives for each potential user; (2) the data items and analysis capabilities which should be included in a system; and (3) using the components of I and 2 in an integrated manner for a total system evaluation framework. 4.1.2 Identi fying Users and User Needs This first step deals with: (1) identifying the decision makers which will use the system; (2) identifying the line agency users of the system; (3) identifying user objectives for using natural resource data; and (4) estimating what the impacts of using the system will be on user decision 78 making. Tomlinson et al. (1970) point to using decision matrices (Fig. 3.1) as a method for identifying proposed system users to describe their roles, actions, and information requirements. Using this tool through a number of iterations it should be possible to define classes of potential users. For example: prima ry users may be identified because their aggregate need will essentially specify the scope and type of system implemented; secondary users will have most of their needs met by the system but not by virtue of these needs forming actual system @Specifications; tertiary users are all other potential users whose needs are implicitly met (thatis, the s ystem. is valuable as.a tool but tertiary user needs did not weight heavily in the design of the system). The.matrix is aimed at assessing user objectives by identifying the methods available to each of them for-implementing planning and management,decisions. This determination is valuable because information need is based directly on these actions and examination of potential implementation tools allows an approximate assessment of the impact of having or not having various information types or an information system. Also information that is not directly linked to some user activity may be of interest but it is of little utility and therefor should be of low priority for inclusion into any system data base. 4.1 .3 Identifying Data and Data Hanlding Requirements This task is important for developing data requirements primarily in terms -of.content. Data which is to be identifie d must be directly related to user objectives which have been defined. The end product of this assessment should be descr ibed in sufficent detail to allow the preparationof relatively simple system specifications. Parameters such as scale, classification schemes, currency and accuracy must be Adopted with standards adopted. Figure 3.2 presents an idealized diagram of the elements which should be a part of 79 this determination. This is amodified tree diagram which is utilized to summarize the relationships between user objectives, information requirements, data content, and data specifications. It is also potentially valuable to prioritize information as economic considerations will surely limit the amount of data that can be included in the system. Weighting techniques can be applied for prioritizing information. Even count ing the number of potential users requiring the same general data and information types places relative improtance on different data and information types. 4.1 .4 Determining Geographic Identi fiers Manual systems rely on maps on which locations of various data types are preserved and directly observable. However, if data are to be encoded for computer processing, directly observable information location is lost. Location identifiers are data items which specify relative or absolute locations (i.e., x y grids, latitude and longitude) which must be acquired, stored and manipulated. The.process of encoding spatial data for computer processing thus incurrs at least on additional data item for each data set and the loss of easy to observe visual data displays. This process as previously stated begins long before actual system design commences and is continued for the entire life of the system. A number of important parameters should be accurately estimated with acceptable accuracy before system specifications are finalized and the design process begins. It is the four following determinations which should most incluence user needs inputs into specifying the elements of system scope and structure: 1. For all users current and potential data needs, use, and handling requirements should be identified; 2. Potential future data needs should be examined; 3. Potential future data handling problems should at the very least be identified; 80 4. Desired future data items and data handling capacilities for all system users should be identified. L1.1.5 Identification of System Users Information system users will come from a number of operating environments: planning, management, research, and operations. Geographic information systems are generally useful to all functions except operations functions which rarely require spatially referenced information. Planning and management agencies and organizations are usually the primary users with research generally secondary or tertiary in terms.of priorities of need and accessibility. Thus, in looking for potential geographic information system users those persons and.agencies engaged in environmental planning and management will form the nucleus of the user class for determining initial information requirements. The first iteration of the design process, which will be described in a later section, will look chiefly at state agencies making up the Land and Water Resources Council. This group was selected.for a number of reasons. First, it is the Geographic Information System Coordinating Committee which initiated this study and therefor can be identified as a group of primary users. Secondly, a number of other potential users (i.e., local communities) may gain access to the system and be involved in actual system design through state departments such as the Planning Office. Thirdly, given the time constraints on this project identifying the complete potential user community would be prohibitive. With this in mind the agency users to be identified and examined in terms of looking at data and data handling needs and requirements are: Department of Energy Resources Department of Environmental Protection Department of Inland Fisheries and Wildlife Department.of Marine Resources Department of Transportation Department of Conservation State Plannina Office 81 Within each major office the needs of individual programs and divisions will be examined separately. For example L.U.R.C., Forestry, Geology, and Public Lands all of which are in the Department of Conservation will be examined separately. There have been very few attempts to systematically assess needs for spatially referenced data in Mai ne. Efforts in the past have been associated with the Maine Information Display'and Analysis System Users Group and with development of the Minor Civil Division Data Base within the State Planning Office. The MIDAS users group (MUG) operating from the early to mid 1970's. With the Technical Services Division assuming responsibility for maintenance of MIDAS in 1972 this group was made up of a number of agencies who at the time were using MIDAS computer data files. This group regularly reviewed data availability, data needs, new data@files and new data analysis techniques. The actual focus of much of the system was socio-economic natural resource civil division referenced data which could actually be retrieved for a number of spatial referencing requirements. The Departments of Inland Fisheries and Wildlife and Marine Resources did maintain resource data but this data generally related to permits and licenses and was stored and retrieved in formats generally unsuitable for mapping and direct spatial referencing. User assessments were generally informal without specifi t forms and questionnaires but reliance was placed on the regularity of the user meetings. A thorough review of MIDAS related activity for the period of 1970-1975 revealed little information on determining user needs and objectives which is of direct utility to this study because MADAS was a statistical data management tool and this study is addressing geographic data systems. In 1976 the Technical Services Division of the State Planning Office began 82 to work on a revised socio-economic non-spatial data base. By this time MID AS coordination was absent and, data fil e maintenance and use was extremely splintered. Technical Services, because of expertise and time and manpower constraints began to focus on socio-economic profile data. This data base is being developed specifically for internal SPO use and to maintain the information. as a service for other data users. The data base includes elements most frequently requested within State Planning.and by other state agencies. Design was started with additional iterations undertakne to aid in shaping thesystems growth and expansion. User needs of other state agencies was not the single overriding factor but were important. It is also important to remember that the data base is being put together primarily to satisfy. internal requirements for socio-economic information. No attempt is being made to address spatially referenced natural resource information because of the potentially large data amounts and the complexity of the data handling and analysis software which are involved with such information are beyond current capabilities of Technical Services-within the State Planning Office. In summary, because this data base is being developed to provide growth, social, economic, housing and expenditure profiles it is viewed as not being applicable to developing objectives for a natural resource spatially referenced data system. This apparent lack of user needs assessments for natural resource geographically referenced data has led the Natural Resource Division of the State Planning Office to initiate an extensive user needs study for natural resource information. This effort (summer of 1977) is part of the Statewide Natural Resource Data Collection Plan and it will be available to serve as part of the foundation for both a state data,collection planning effort and continuing Geographic Information System studies. 83 4*1 .6 User Issues to be Addressed Based on figure 3.1 the following elements were identified as being important to an initial data related surveying effort: 1. Statements of agency missions and mandates. 2. Data types currently utilized by each agency to satisfy current planning and management needs. 3. Data which is collected and archived by agencies primarily for use by other data users. 4. An assessment of current'and projected data manipulation capabilities and problems. 5. The need and methods to link statistical, spatial, and cadastral data systems. 6. The implications of 1 ... 5 on geographic information system design and development planning.. Constant updating of these parameters should continue through system implementation getting more specific in each succeeding survey to be able to accurately estimate: 1. Updated data and information requirements for system users. 2. Current data analysis techniques 3. Required data needs whichare not being met at present. 4. Realistically useful data types and analysis capabilities. 4.1.7 The User Survey The material to be presented in this section was derived from a user needs questionnaire, a follow-up telephone interview and a personal interview. Material is viewed as being preliminary as@this is only the first iteration of a multiple step process which must be undertaken to fill in the matrices which were presented at the beginning of this chapter.: (A copy of the questionnaire appears in Appendix 3 of this report.) 84 This section is intended to only yield a brief overview of spatial data handling activities of the various members of Maine's state resource planning and management system. The mission statements which are presented give generai overviews of the fundamental aims and directions of data related activities. There are also concurrently data related objectives which are not directly formulated by those framing the various agency missions. The spatial data handling.required to support the activities of these agencies can be described in terms of four broad categories: I . Data gathering and dissemination; 2. Data production (i.e., map cartography); 3. Scientific research; and 4. Land resource evaluation and clossiFication and administration of permits, leases and regulations@ Within these categories there are a wide variety of spatial data applications. Generally data amounts are relativly large and data handling is sufficiently complex to warrant the con- sideration of explicit state-wide policies directed toward the solution of data associated problems which can be identified. The volumes of data which are needed and the complexity of data analysis procedures are aspects which greatly influence costs associated with data use. 4.F.7-1 The Survey Results As resources planners at the state level are engaged in diverse management, enforcement, protection and identification activities (Wright 1977) a composite view of their data needs must reflect this diversity. The key to a successful data needs survey and subsequently a complete data planning program is the identification of levels of data need and use commonality. The 85 range will extend from agency specific data (i.e. deir fertility rates) to relatively general data for planning (i.e. land cover types). Inland Fish and Wildlife Inland Fish.and Wildlife currently, relies heavily on. internal computer data processing capabilities for data use and storage. Much of this effort is associated with updating their data bases for individual species plans, liscensing, and regional modelling for wildlife planning. Approximately 20 Midas data@ files are maintained for these -efforts using standard geographic referencing and data types C6 ding. The agency's data handling system@ is well established and needs appear to be well prioritied. Personel were able to estimate two important data related trends (amount and level of complexity in data use for problem solving) relative to present and projected activities. AMOUNT COMPLEXITY OF OF DA TA DATA 1977 198 1977 -77992 Aerial photography and various ma p data types are widely utilized. Major data needs are detailed land use and cover types ma pped and statistically summarized for each minor civil division and organized territory. In summary thi's agency is handling current data types well, and appear to be well situated for handling future data as it becomes available or is collected.1 Specific data handling tools such as computer graphics could be well utilized because of current agency reliance on an automated data base. Department of Transportation the Department's Bureau of Planning maintains a continuing review process 'or all 186 transportation related activities with respect to environmental impact assessment. It also formulates environmental policy and administers the scenic highways program. Primary data files are the statewide system of highway maps, various vehicle re- lated parameters, road inventory data, accident records, speed data, and commodity movement data. The Planning Bureau relies on general data such as forest types, topo- graphy and slope land use change, housing population, and wetland maps as basic input for their planning activities. The Bureau also does utilizes automated data files and digitizer recorded data for a portion of-their total detailed data base. In summary Bureau personnel felt that most of the data they require is available and being utilized, however, personnel did recognize that significant improvements can still be made in their techn iques for data storage, and retrieval. Evidence such as increased digitizer encoded information attests to their attempts at streamlining data activities. It certainly appears that a number of the previously discussed general data types (i.e., soils) could be held in a central automated data bank and completely satisfy current and projected Bureau needs and be equally available to other organizations with similar data needs (see Wright 1977). Bureau personnel did not feel they had a sound basis for projecting data and data handling needs for 1977-1982 so a projection i's not included here., Such projections are therefore omitted from the remainder of this report because this specifically appears pre- mature. Department of Conservation Bureau of Public Lands The Bureau manages public reserved lands under principles if multiple use, assembles land in larger tracts@, inventories the naural assets of, public lands and does agricultural and 87 timelier management planning for these areas. Primary data files and data types required for planning are: The Coastal Islands Registry; The Public Lands Resource Inventory; and general data types such as forest stands, agriculture, wetlands, soil types, topography and slope, parcel boundaries, wildlife resources and lake water data. Most pressing needed data are detailed timber and soils mapping. Where appliable the.Bureau does utilize standard MIDAS and state codes for all data records. The Bureau relies heavily on data which could be efficiently retrieved from a common data base and aggregated for detailed analysis using standard geographic information system capabilities such as overlay, area, search and regression analysis. As mentioned, Bureau personnel felt unprepared to accurately portray projected data volumes and data handling requirements for 1977-1982. But they did generally feel both data amounts and the complexity of requ red analysis would increase sharply. Bureau of Forestry The Bureau carries out timber improvement studies and woodlot reconaissance, provides technical assistance for L. U. R. C. permitting and shore land zoning, and provides assistance to timber operators and processors. There are a number of specific task related divisions within the Bureau. Primary data files are the Forest Insect and Disease Survey; Fire Danger Records, Fire Reports, stumpage sales, lists of logging firms and processing reports. General data types used are topography and slope, vegetation and soils. Currently the bulk of forest related map data and statistics come from the USDA Forest Services Forest Inventory which are conducted at 10 year intervals. Major data needs are detailed forest cover typing, 88 forest suitability data and parcel ownership information. The Bureau has unsuccessfully partially relied upon MIDAS files for data storage and retrieval. System problems interfered with the data reporting and analysis requirements. Bureau personnel.felt they could in the future adequately plan with a statewide data base of forest types, soils, parcel boundaries, and other general data types. Such a data base could be readily stored in and retrieved from a larger statewide geographic information system, and formatted and aggregated for specific bureau purposes. More specific data types could be accomodated in such a system as well or housed managed, and analyzed inhouse by the Bureau. Personnel felt they had no solid basis on which to make data and data handling needs projecti ons. Bureau of Geology The Bureau maps, interprets, and. publishes geologic information and provides interpretive information for planning and regulatory agencies; studies are being conducted on surficial deposits such as scnd and gravel, groundwater conditions, beach erosion p roblems and dune management plans. It works in wetland protection permitting, evaluating dredging permits and is involved in a study for nuclear plant and industrial facility siting. Primary data files include surficial and bedrock geologic mapping, groundwater, and well data. Bureau personnel detailed the need for data analysis such as@contouring from point data, and statistical analysis, and histograms construction. As well the need for retrieving and for- matting data for activities such as landfill application review is also a ma,jor area of concern. The Bureau is involved in the development of surficial geolgic and other data types which are essential to land planning and management activi ties in Maine. A number of data types used by the Bureau are relevant only to the bureau's activities but generally the agency does and will play an essential role in the development of a central statewid e general planning 89 data base, because of its role in the development of most geotechnical data for the state. Bureau of Parks and Recreation Parks and Recreation purchases land, and designs and develops recreation facilities and boating access sites. It prepares the Maine Comprehensive Outdoor Recreation Plan, conducts scenic rivers studies and provides technical assistance for local recreation programs. Primary data bases are Private Commercial Recreation Areas, Quasi-Public Conservation Areas, State and Federal Recreation Areas and Municipal Recreation Areas. Primary data related activities relate to long range plal nning and facility design. Other information which :the Bureau requires are forest types, land cover, wetlands, soil types, landform.topography and slope,, population, utility locations, fish and wildlife resources, lakes, watersheds marine resources, and floodplains. Data needs appear to stem from the@ lack of personnel for extensive raw data interpretation (i.e., from aerial photographs) rather than from the complete absence of raw data Land Use Regulation Commission LURC is responsible for collecting and maintaining data needed to regulate and guide land use activities; protect natural resources; and plan the future of the unorganized territories and plantations. With respect to applications review :the data is primarily gathered and provided by the applicant. Existing conditions discriptions and discussions of impacts are required. LURC generally must augment this information with land use, social historic and regional natural resource data. Two efforts, planning to gain an overview picture and establish basic infor- mation foundations, and development review which generates a significant amount of data are proceeding together'. Most heavily used data types include wetlands, soils, topography 90 aild slope, land use, wildlife resources,..wa te rsheds, lakes and floodplains. LURC p!anners expect the amount and typesof data required to increase 'n the future. Ava i labi I i ty of new data types such as LUDA cover mappi ng wi I I change information rella fed patterns. Primary data needs are detailed cover typing, complete soils coverage, accurate wetlands maps and periodic high altitude photography. Again, LURC planners have identified data types. Often associated with geographrc information systems as being primary in terms of use and need as high priority.dato types. Data analysis and interpreta tion activities are projected to increase in complexity and extent in the near future along with the amounts and specificity of this required data base. Department of Environmental Protection Bureau of Land Quality Control This Bureau controls all significant development activity which may have an adverse impact on the environment; regulates location of solid waste disposal sites through the Municipal Waste Disposal Act; has started an extensive program of groundwater monitoring around existing dumps; regulates septage sludge and water treatment plant sludge disposal and protects Maine waters against discharge with emphasis on potato wastes and pesticides; conducts programs aimed at reclamation of mined areas and administers the Site Location Act. Primary data. types utilized are: land cover for watershed work; agricultural non-point sources; soil types and landforms; groundwaters, and topographic and slope analysis mapping. Also land use, urban housing, population, industrial locations, fish and wildlife resources and watersheds and floodplains are used. With few exceptions their current data required for plan- ning is general resource data aggregated for specific purposes according to the analysis under- taken (basin nutrient budget estimations). 91 Data needs stem from both manpower shortages and from available data which must be organized or aggregated to satisfy specific analysis requirements. Good runoff and climate data, flow and discharge and water quality data are identified as primary data needs. Water quality data enables agency sampling so expansion wil I be determined by future bureau capabilities. These agency specific data problems at least initially must be viewed as being beyond the realm of a statewide data planning effort. Office of Energy Resources Energy Resources prepares the Comprehensive Energy Plan for the State; developes energy Policy; assists individuals and organizations in drafting proposals for funding by Federal research and development agencies; and maintains a well stocked energy related library. Current data related activities focus on creating an energy needs data base consisting of needs; storage locations, surplus and transport data for the state. The agency is also involved in theNew England Energy Management System which may even@ually provide a vehicle for storing and analyzing spatial and statistical data relevant to envery resource plan-, ning. Specific data trends are difficult to project because much of the Departments work is response to specific problems and conditions which may change over short periods of time. In general data types related to population, land use, and transportation will continually be used and specific data types such as fuel oil supplier locations will be activated and deactivated as the@r need arises and falls. Although incomplete, GIS related general data is valuable to the Depart- ments planning programs and goals when utilized with requisite energy related data parameters which may be too specific for inclusion into any general statewide data base. But, this data should be available from Energy Resource i to parties through designed data networks determined by a larger statewide data planning program. 92 STATE PLANNING OFFICE H. U. D. Land Use Element Introduction This program is designed to conduct policy related studies developing recommendations. concerning improvement of Maine's planning and managment framework and complete program applications forH.U.D, Current work elements, include state resource use and constraints analysis conducting policy related studies and describing Maine's resource managment system. Current Data Use and Requirements Because the program is state overview oriented,in the post there has been little compilation of data, instead the focus has been on various policy related activities. However, the program is currently involved in the development of a "State Overview Opportunities and Constraints Analysis. This effort involves the preparation of 1:500, 000 scale maps, tabular statistics,and supportive text for a number of natural resource and socio-economic land use parameters.The result will be a binder type publication for easy periodic updating of the maps and tabular information types. Specifically, the report will include 20-25 basic inventory maps,initial resource and socio-cultural trend map and general capability mapping and analysis.Mylar bases will be archived for all of this information to allow the distribution of print copies to all interested parties. A second thrust,which is still in infant stages of development, is a geographic specific resource allocation mapping and planning project. It is projected that this effort will involved SPO,LURC, and the New England River Basins Commission.Again a number of natural resource and land use associated parameters will be mapped and analyzed at scales of 1:100,000 to 1:250,000.Products will include basic single factor maps and a number of composite analysis maps such as special resource areas and locations with high development potentials- Implementation of study Findings will be unbinding on regional and local actions but should help shape federal state resource oriented activities. 92-a A third data group which will likely be a part of this program will be the LUDA Program ouputs available from U.S.G.S . These 1:250,000 scale maps will include information on land use and cover types, watersheds, census tracts and public land ownership. Computer compatible magnetic tapes for all this information will also be available. it is projected that this data will be archived for both in house uses and for public distribution to other state and regional agencies as well as private data to users. Data Handling Requirements The Overview Opportunities and Constraints Analysis will involved archiving the base materials for composits analysis work and for distribution. Eventual automation of data base is desired for extending analysis and cartographic capabilities.Whether automated or manual this will be the first such overview data centralization for Maine and as such data handling is. difficult to project.Five year projections for data handling are as follows The larger scale geographic region studies will involved extensive amounts of map data and associated tabular information.For this effort, due to its current initial stage of development it is also difficult to estimated data handling requirements. Pro cted data handling requirements for this effort are as follows. D.A. associated data handling is not viewed as a complex data-handling p'jr*Uob'lem unless there is significant demand for use of the digital data, which is doubtful. Evafuciion of Data Needs and Requirements The HUD Land Use Element will be involved in compiling, analyzing and distributing a signiFicant amount of state overview natural resource data.As data amounts increase data analysis activity will also be rapidly increasing so by 1982 a substantial data base and data handling problem will have been established. The data needs questionaire and interview seem to present a clear understanding of the future direction of the program in terms of data amounts and handling requirements @being recognized by those in the program. Eventual automation of at least parts of the data base does also seem likely to increase data analysis capabilities. 92-c State Planning Office Coastal Program The coastal program provides information for private and governmental decision.makers so informed land use decisions can be made. It also provides educational and technical assistance in coastal matters. Major program thrusts are: technical and financial assistance; developing recommendations on major coastal issues; data collection for resource planning; and the publication of resource maps and planning handbooks. The Coastal program has been operating under 305 CZM funding. As such its major role has been one of data provision for planning. A Coastal Scenic Inventor y, Coastal Aflas, Coastal Socco-Economic Inventory,, Coastal Lakes, Recreation Facilities and.Climatologic Reports are the primary data bases. The program will soon be operating under a 306 CZM, funding mandate. As such the primary thrust wi I I become the provision of funds for local initiatives and coordination of local planning activities. Primary activities are associated with data compilation and distribution. The Coastal Atlas is currently in demand having to meet up to fifty data requests a week. As well, a number of publications are widely requested and present a moderate handling problem. The Coastal Scenic Areas Inventory is essentially nonfunctioning at present. The program must also handle a number of random requests for nonprogram generated data which is archived by the program. The building of complete indexes for all archived maps and aerial photographs and, compilation of the socio economic activity are also important current activites. It is felt that' information handling requirements are currently peake and will begin to taper off. The 306 program will require no or little generation of new data. A part of the grant does go to statewide coastal planning issue studies but much of this work will be done within other 93 state line agencies. Previously generated data will be the primary data base. Data handling activities are currently peak requiring full time intern assistance. Should no major changes *ccur data handling problems associated with coastal program information will fall to line agencies and regional and local planning and management activities. State Planning Office Critical Areas Program The 'Critical Areas Program consists of two phases: Area registration and conservation implementation. The primary objective is to inventory and compare the value of significant natuai features of the state and decide which are really of state wide significance. Major work elements. include evaluation and registration, report preparation, maintenance of a registry, a data storage file, and provide assist once to data users. Most data handling currently relates to the area registration process. Once subjects to be inventigated have been established priorities are formulated on which areas will be looked at. Planning reports which present what types of areas should.be included are prepared and then recommendations forspecific areas are made. The Critical Areas Board reviews the nominations. Data handling activities relate to the publication and distribution of planning reports and information on specific areas. Data currently exists on 62 registered areas; 15 areas currently being registered and 150 areas which should be registered by the end of the year. It is pro- jected that 100-200 areas wi I I be registered a year. A second data,oriented activity deals with areas which are being studied for application of appropriate. conservation measures. This activity is infant and data needs are unknown. As well the areas are archived on 71 minute quadrangle maps and in descriptive computer files. 94 Data handling consists of collection, archiving and evaluating data on proposed or nominated. areas. With current staffing and funding, given projected registration, data handling should not be a major problem. Data handling trends show registration should continue at about current rates. As the program moves into implementation greater data analysis and distribution may be required for areas for which conservation, implementation is proposed. As wel I the tota I amount of data archived lby 1982 will be significant as 600-900 areas will have been regis- tered. Data needs arise with specific area nominations. Primary data collection by the program is not significant, as consultants and persons nominating areas often provide basic data. The quadrangle maps for area mapping are the only map type requiring state wide coverage. Water Resources Prog m This program is involved'.1n comprehensive water and related land resource planning. Primary objectives are intensive water supply evaluations, data acquisition efforts, flood damage reduction planning, future water use projections, working towards water quality goals, and itoring state regulations in regard to water planning. Current work elements include water moni supply assessments, data acquisition for river basins, critical areas work, specific pollution studies and watershed measurement projects. Many of the activities of the program are not specifically data related. However, the program has created and maintained a number of water related data files. These have 'ncluded @Ie Lake -Name Index (MIDAS 906Z); the Dams Inventory (MIDAS 543W).; the River Names Index (MIDAS 905Z). Because of lack of technical support these automated files have been turned over to Inland Fis and Game. In addition a Listing of Great Ponds; A Lakes Studies Inventory, and A Map Overlay System (U86S 1-250,000) have been de-veloped @hroug@ @he 95 the program. Much data currently used I's generated byline agencies such,as Inland Fish and Garre and Environmental Protection as the programming assumes a planning-coord" notion ro!e rother than one as a primary data collector. The handling of automated files had proven &fficult. so files were turned over to another agency. The program is stil I ceni ral in "he rnap overlay system for the state. A specific data handling need which has been identified @s the inability to access and retrieve existing automated information by river basin for basin planning studies, as an effort to establish basin by basin data needs is also underway. The indent program does not have actual projectionsfor levels of data collection, handling, and analysis for the 1977-1982 -kime frame. Note: Department of Environmental Protection, 'Bureaus of Water and Air Quality, Depart- ment of Agriculture and Department of Marine Resources are not included because interviews could not be scheduled while project was underway but w i I I be Included in all future interations. 4.1.7.2 Composit User Needs Wright's (I 977),survey of data use and needs for Maine yie!ded soi Is, watershed and land use/cover to be critical data elements for statewide planning. The Data Priorities survey and this work reveal that the most universal problem facing state resource planners and managers stem from shortages of funding and personnel. Because only agency specific goals current ly/p!a nne rs, guide planners procure their own information often resulting in essentially redundant data collection. (exerpted from Wright 1977) A 96 low as :M 4z Table 3.2 Data Priorities Based on Data Use and Needs. Nlag,' Planning ,@ Iles Use For - lev M,.-APPING SCALE Report Resource Category 1/1200 1/2400 1/6000 1/12000 1/20000 1/24000 1/50000 1/62500 1/125000 1/500, 000, Point Tabular Land Cover (gen.) x x Forest Stand Types x x x x x Other Vegetation x Agriculture x x x x x x Wetland (salt) x x x x x x Wetland (fresh) x x x x x x x Soil types x x x x x Bedrock Landforms x x x x x Glacial Land Forms x x x x x Ground Water Locations x x x Topography & Slope x x x x x x x x x Land Use x x x x x x Land Use Changes x x x x Urban Housing x Population Densities x Industrial Locations x x x -Parcel- Boundari-es- - x x x Utility Locations -x - -x Transportation Facil. x x x x x Recreation x x x x Fish &Wildlife Resources x Critical Areas x x x Marine Resources x Watersheds x x x x x Lakes x x x Flood Plains x x x x Source: Wright 1977 Table 3.1 Resource Information Priorities Resource Category State Planning Land Cover 4 Land.Use Forest Stands 2 Agriculture Wetlands' - Fresh Soil Types 3 Bedrock Landforms Glacial Landforms Topo and Slope 2 Land Use Change Housing Population Density 1 Recreation Fish and Wildlife Critical Areas 3 Marine Resources Watersheds & Water 5 Resources 98 Table 3.1 presents Resource Information Priorities base on commonalities of response and need. Table 3.2 presents the scale differences which were noted for these data types. (A system centralized common data typeSrwould therefore be required to meet differing requirements of scale and aggregation). 99 4.1-.7-.3 Summary of User Needs Assessment It is gener'ally accepted that geographic information systems should have user needs well defined and these needs should be primary defermina nts in system coverage and develop- ments (Tomlinson, et. al. 1976). Although generally accepted, there are no easy methods to acquire information about user demand. In the past it has been often only assumed that large scale geographic systems sponsored by public agencies are designed to serve a broad range of users, and its assumed that the system products will be automatically desired and used. Tomlinson et. al. point outthat user demand surveys are difficult to conduct and almost impossible to interpret. The principle constraint is that user needs investigations require responses concerning proposed products rather t@an real products. In the future various .methods of determining user needs must be used (such as interviews and questionairs). As well@price initiated a system program must contain an extensive user interraction program to explain data products to potential users, to provide users with prototype products and to receive from users an assessment of the utility of data products. 4.107*4 Required future activities Three efforts in. regard to user need assessments should become regular parts of activities associated.with data and information system planning. These are: I ) continuing use of interviews and questionaires in user ne@ds determinations 2) assessment of the need to directly link natural resource, socioeconomic and caclastral information. 3) The provision of example system outputs. 100 4.2 Assessment of Data Availability 4.2 .1 Introduction Some of the most critical decisions f 6 be faced in developing spatial data information systems concern methods of data acquisition. For example, ifthere are no constraints on time or cost, most system developers would choose to collect th, eir own primary data. However, numerous constraints generally require agencies, at least in some part, to rely on other sources for much of their data. Significant among these constraints are dollar costs of basic data collection and the fact that similar data are collected and maintained already by other agencies. Use of secondary data has immediate'appeal because of reduced data associated costs, but- the requirements for data handling and updating over a long time may create large initially unseen costs. Generally though, information system development is feasible only with the extensive use of secondary data sources which may be supplemented with primary data only when necessary. Therefor, any strategy for implementing a geographical information system must include an assessment of existing data sources (federal, state, local, and private). Such assessments must be long term, systematic, inerative and thoroughly documented, becay se the continued existence of a useable system will depend on the quality and reliability of the sources of data. Three specific subparts of data planning need to be undertaken to satisfy this need: identification of available data; documentation of available data; and investigation of spatial referencing systems. 4.2 .2 Role of a State Data Plan in the GIS Data Availability Assessment As previously introduced in Chapter 3, the proposed state data collection planning activity and the continuing geographic information system study should proceed on a arallel course. i p This duality in developing the two systems should allow development of linking mechanisms for the three basic components of data planning - user needs assessment, data availability assessment, and provision for storage, retrieval, and distribution of information. As well, it is proposed that this data related effort could also involve providing planning reports on specific data types such as capability 101 and suitability analysis Utilizing GIS associated capabilities. A major part of the data assessment effort will focus on inventorying resource information for Maine. A second major thrust derived from assessing user needs in terms of data handling will be the establishment of standards for new data collection.. Scales, level of detail., accuracy, coding and formats will all be examined to meet the requirements of the greatest number of potential data users. Federal collection activity will be monitored. The state would submit statements a f priorities to the various federal collection agencies to provide maximum consistency with user needs and geographic infor mation system requirements. Formal liaison would be established between the Maine Land and Water Resources Council and,the various federal collection programs. All state and regional agencies proposed data collection activities would be reviewed. Agencies would.not be required to be consistent with the plan but recommendations as to where it appeared that consistency could be achieved without extra cost would be integral in this effort. 4.2 .3 Data Identification Introduction This section presents a general review of the.data available for state resource planning in Maine. Later sections include discussions of issues rel ated to data acquisition, and characteristics of data that affect both their handling and use. Emphasis is on implications for data handling rather than on specifics of data use. Reso urc e management and planning have environmental, natural resource, social and economic dimensions. Diverse types and considerable quantities of information should be considered in order to review the current status, past trends,. and possible future directions in these areas. Timeliness and availability of information are the key considerations since decisions are often made on the basis 102 of information which is available, without waiting for collection of additional data that would be thought relevant. Although numerous data exist, they are scattered through the files, libraries and information systems of federal, state, and local public and private organizations. The data are often difficult to find and often exist in formats which are not of direct utility. Availability of data is limited both by the extent of federal and state data programs and by the capabilities of current inforTnation systems. Related to thist states are facing massive problems in developing data bases for resource planning. Data series on land use, soils, an ;d geology are not complete for Maine as is the case for much of the country. In summary, data generally are not available for the majority of tasks for resource management, and it is both appropriate and necessary to identify and review information sources, distribution programs and efforts aimed at data indexing. This effort is initiated as part of this work not to serve as a compilation of data sources, but to identify the agencies, programs and individuals who are responsible for decisions concerning the availability of natural resource data. 4.2.3.1. Federal Data Sources 4.2 .3.1.1 Indexes of Natural Resource Information There is no catalog or directory of al I the data sources and series of either the federal government or any state government. There have been movements toward such compilations at various levels of government but these are incomplete and generally stop short of listing data series item by item. However, two types of efforts.are of interest to resource planners which will be discussed here in detail. These are: efforts to compile directions of data collected by federal agencies;and the establishment of the National Cartographic Information Center. Several federal departments have und,ertaken compilation directories to thedata and information 103 systems within their organizations. These are identified to establish contacts With the indexing efforts which will most likely play important roles in the evolution of a state data collection plan and allied information system efforts. The feasibility of establishing formal ties to important data indexing efforts will be examined as part of a continuing effort. Environmental Data Index (ENDEX): ENDEX contains computer-searchable descriptions of inter- disciplinary files of environmental data on many levels. Approximately eight large environmental data files may be searched through the ENDEX System. When these files are large, detailed inventories are also provided. Specifically, ENDEX has three major components: J) descriptions of data collection efforts; .(2) descriptions of data files; and (3) detailed inventories of large, commonly used files. An ENDEX data file description lists the types and volumes of parameters available, the methods used to measure them, when and where the data were collected, the sensors and platforms used, data formats, restrictions on data availability, publications in which the data may be found, whom to contact for further information,, and the estimated cost of obtaining the data. Individual ENDEX data files des criptions will be updated every 2 years. ENDEX services and products include: (I ) access to specialized indexes of environmental data, grouped by geographic areas, institutions, or disciplines; (2) on-line, interactive searches of the indexes to answer specific questions concerning the availability and whereabouts of data files; (3) a quick-response determination of the costs of retrieval from large data files; and (4) data catalogs from large NOAA environmen fa I data collection projects. Oceanic and Atmospheric Scientific Information System (OASIS): OASIS is a computerized information retrieval service that provides ready reference to the technical literature and to research environmental sciences and marine and coastal resources.. It provides computerized searches of both NOAA abd non-NOAA data bases containing references to technical publications. Approxi- mately 33 major environmental data bases may be searched through the OASIS system. OASIS offers 104 access to major meteorological and oceanic bibliographic information riles not available anywhere else in computer-searchable form. USDA Data Inventory. Office of Information Systems, USDA. 1973-1974. 7 vols. The USDA Data Inventory provides a concise listing and description of the data files used or produced by agencies within the Department. Each of the First six volumes contains the program data requirements for one or two of the missi ons of the Department. (A mission is a grouping of Department-wide goals that characterize the Department s role in solving broad, national problems. The 10 missions are comprehensive and include all activities of USDA.) These volumes are: (1) Agri- cultural Exports. Foreign Agricultural Development; (2) Rural Development. Encironmental Improvement and Resour ce Development and@ Use; (3) Support for Non-Federal Governments and Institutions. General Administration and Program Support; (4) Food and Nutrition. Consumer Services and Human Resource Development; (5) Agricultural Production and Marketing Efficiency-,- and (6) Farm Income. 'The seventh volume is a combined subject index which lists all subject terms contained in the preceding six volumes and the associated program data. Each of the First six volumes contains five sections in addition to the Introduction. Section 11 is the actual inventory of program data requirements reported by each agency as being used or produced in support of a program for which they have responsibility. They are listed alphabetically by title within a Department mission. Each entry in this inventory provides descriptive information about the data requirements as follows: data inventory number, title, the agency that reported use of the data, the time period to which the data relate, the accessibility of the data, the mode of processing currently being used, Program Data Requirement Identification number, subject index terms which describe the general subject matter contents of the requirement, an abstract describing the contents of the data included within the requirement, and the division, branch, or unit that can respond to questions concerning the data. 165 Automated- Systems Inventory: All major automated systems (operational and developmental) within, the Department of the Interior are described in the Automated Systems Inventory which is maintained by the Office of ADP Management. agency Resource and Land Unvestigations of USDI has produced several federal Wide data indexes and a 3. 000 page draft directory of existing data within USDI .This work was completed in 1975 and is being updated in, 1977. It will be available to the State Planning Office and other statewide users in Fall 1977. National Cartographic Information Center: This center maintains a data system containing information on the availability of aerial and space images, maps? charts, geodetic data and related digital data produced by federal agencies. The center also maintains information regarding the status of on-going Cartographic data collection efforts. A formal tie for continual updating of cente r activities is available through a quarterly newsletter which they publish. Earth Resources Observation. System Data Center: The EROS Data Center maintains, reproduces, and sells to the public digital and photographic data acquired by satellites and high altitude aircraft .. Input is received from LANDSAT I and 2 as second generation Film negatives, 16 mm microfilm of LANDSAT scenes and the NASA Skylab Earth Resources Experiment Package in various film formats. Also offered as standard products a.re 16 mm microfilm for th e 470 color composite scenes for the World Reference System 16 mm black and white microfilm of USGS photo indices, 16 mm microfilm of NASA aircraft missions in color and black and white, and 16 mm microfilm of the NASA Skylab missions in color and black and white. A number of Istates (i.e., Texas) have established direct computer ties with both. the Cartographic Information Center and the EROS Center. 106 4.2.3.1.2. Federal Agency Da to Bases and Data Systems This section briefly describes federal; data sources for spatial information and the system's the data bases are a part of. The important aspects of this information are not the data specifics presented but rather the agency programs, plans, and contacts which should all be regularly monitored to ensure continued availability of important listing data types and the timing on planned data collection and storage activities for planning future availability of data. Data systems of important are summarized, as well, for locational information and not for system specifications. Again, it is proposed that this data and activity monitoring can be best accomplished as part of a formal state data planning from ework. 107 DEPARTMENT OF AGRICULTURE Rural, Development Office This office is responsible for the development and administration of various farm oriented loan programs and for coordinating a nationwide rural development program utilizing the support services of a number of other federal agencies.The office holds no specific spatial data bases with coverage in Maine but it does operate ari information system of potential interest.The Federal Assistaace Programs Retrieval System provides data to local A community rural development programs,for how -communities may meet basic eligibility requirements All programs within the system are keyed to the Catalog of Domestic Assistance. DEPARTMENT OF AGRICULTURE Forest Service The Forest Service is primarily responsible for the managment of federal forest reserves. As.well however, the Service has responsibility for nationside leadership for forest managment and planning applications and research efforts.The service also, cooperates with state and local governments,agencies and organizations, forestindustri es, and private land owners in the protection, reforestation, managmen t, and utilization of millions of acres of forest lands and lands associated with vital watershed protection areas,, Important Data Bases and Information Systems are: I.An overwhelming majority of data bases held by the Forest Service pertain to the lands which they directly manage.Their single data base with complete national coverage in the National Forest Inventory.This in an extensive ten year incremental look at the forest related resource's of the country.For every state a large data base consisting of anumber .- of forest related parameters is collected,.stored, and analized. Most of the data are geographically referenced and spatially mapped. 108 The last survey was completed in 1,970.This information was collected and is now ar6ived without the use of ccrriputers. Plans are tenative to automate the 1980 survey using an accepted Forest Service data managment system which is currently under development. AIthough it actually holds little data of interest the Service has been in the forefront in the development of computerized spatial data handling capabilities. Presently because of the proliferation of such systems within the Forest Service(II major systems) the. Service is engaged in an effort aimed at taking the best aspects of this existing work and developing t wo standard systems for nationwide use.The two systems would include a grid mapping system and a polygon mappimg system. Because the final design of the systems is nowhere completed a number of their important existing systems are briefly outlined because of their particular rblevance to particular data handling problems. A.Total Resource Information System(TRI)- is a natural resource data base information system.. The system data base as it has been developed by the Forest Service is divided into a number of subsections covering data under the headings of recreational data, land cover data,seenic resource data,and aquatic and terrestrial resource data t-/pes.TRI is a basic grid cell system. It can perform statistical computations such as ccwss tabulation but it cannot graphically overlay two different data se ts for the same geographical area so from the graphics point of view its usefullness in resource olanning is limited. B.Polygon Layer Overlay Techn ique(PLPT)-is a polygon line system which stores coordinates for polygons and the identifiers which d :escribe them.The principle utility of the two PLOT system is in the overlay of at least A different mapped data sets creating a new polygon map from the composite of the maps overlaid. A number of statistical routines and mapping variations can be applied to both the source maps and the composifes.PLOT is also a useful system in that it can accept input at any scale and can process and extract data for overlay in teractively. 109 C.LIM,R3MAP.,and MIADS -are basic grid mapping systems.These systems are of utility if spped and ease of application are required and visual clarity of the maps, fidelity of area boundaiies.and the precise evaluation or' areas.may be sacrificed to varying degrees.All three systems are structured in that all only have line pri.nter output,not scale change for data inputting and few interactive capabilities. D. T he Geographic Locator System(GELO)-is a line segment system where each input polygon is broken down into line segments.fach point along a line segment must be entered into the data base with a right and light identifier code to identify polygons for analysis and retrieval. The data processing features of the system are:(I)store and plot back maps from any resource data type;(2)overlay one resource map an anoth er and produce composite maps; (3)join adjacent maps to create different data coverages for single analyses; (4)cal cu late and report for a number of statistical pciraimeters;and (5) produce.master. catalogs of all spatial data files in the system. Geographic referencing is by latitude and longitude and scale change For inputting and outputting data is also a basic capability of the system. 110 DEPARTMENT OF AGRICULTURE Soil Conservation Service The Soil Conservation Service is engaged in a number of activities and programs which involve collecting, storing, analyzing and distributing vast amounts of spatially referenced dato.Primary amont these are the National Cooperative Soil Survey,The Conservation Needs Inventory, Flood Hazard STudies, Inventories for Resource Planning, Land Inventory and Monitoring,The Storage and Retrieval. System for Soils Woodland and Range Data, Snow surveys and water supply forecosting,dnd their watershed and river basin planning programs. I.The National Cooperative Soil Surv@iey-4s generally composed of both general and detailed soil surveying activities which are carried out within state field offices. 2.The Conservation Needs lnventory7@provides soil,wate;and land use statistics for the nations land.The inventory was carried out in@ the late 60'd and' early 70's using a ra Indom statistical sample that ranged from 2% to 10% of, the total area of the various states.The system is automated and summaries and unaggregated data are available.The survey is currently being -updated (1977-1978) under the new title of "The National -Erosion and Sediment Survey." 3. Snow survey and -rainfa-11 data are Col lected and archived by state sources, 4.1owa State Statistical Laboratory Data System-contains soil names and descriptions for 95% of the soils of the United States.The SCS Cartographic Office in Hiattsville, Md. along with the Remote Sensing Task Force and the Resource and Managment Information System sTem Task Force are developing an Advanced Mapping @ which is uti I ized for automated mapping of soils information.As well,efforts are underway to arckive a digital soils data base utilizing the systems capabilities.As well interpretative capability mapping is part of the system.The system currently relies on manual digitizing and raster scanning for data inputting and a Gerber Plotter for output mapping.The system all- the moment is primarily being utilized in the Important Farmland Mapping Program, In terms of long ran ge deve.lopment plans SCS is planning +e development of a regional office accessib le data system which would be required to interface SCS internally generated data with the topographic, geologic,and socio-economic data of other federal agencies.It is projected that the system will be completely interactive in the regional offices.The system will also directly interface with a number of aerial imagery sources.Direct picture processing and image incorporation into the data base is a projected capability. Data banks of the most currentl LANDSAT imagery and the most current without cloud cover will be included and readily retrieveable.The purpose of this system is to permit SCS planning pe.rsonel in the Field offices to be able to model the landscape overlaying and compositing soils informaltion with land cover, land use and topographic data files. No specifics of the system are currently available as it is still in the early stages of planning and development. Department of Agriculture contacts are: Mr.R.Bacelius-Remote Sensing Task Force and Land Inventory and Moriltoring Program of the Soil Conservation Service Mr. Paul Holm-U.S.D.A. Office of Information Systems MrJames Lables-U.S.D.A Office of Information Systems MrJohn Kennedy-Office of Data Managment of the U.S.D.A. Forest Service 112 DEPARTMENT OF COMMERCE Bureau of Census GeographX Division The Geography Division of the Bureau prepares maps and materials for use by the other data col lection and analysis of the Bureau. For al I statistical census data types(i . e. , census of agriculture, economics, population.I'ect. I)geographic referencing Files are available and such files have been incorporated into both federal and.stateside geographic information systems. Spatially based census files currently held are: I)Master Enumeration District List-which summarizes the codes which identify the geographic areas presented in the 1970 census 616s. Counties,census tracts, and SMSA's are all coded for eventual cross referencing. 2)GBE/DlME Files which are street inventories of the urbanized portions of SMSA's relating address ranges to the various geographic codes. .3)Zip/Tract Cross Reference File is a correspondence File between zip codes and census tract codes. 4)DIMECO-contains latitude and longitude coordinates describing the boundaries of all counties in the United States. 5)ADMATCH-is a computerized address matching system for relating local area descriptive data files with the geographic referencing files such as GBE/DIME. 6)GRIDS-is a computer mapping program for displaying grid oriented datc.This is the primary program for graphically presenting ADMATCH analysis information and composite maps. 113 DEPARTMENT OF COMMERCE Economic Development Administration Business Research and Analysis Office This office in con6unction with the Water Resources Council and Regional River Basin Commissions has developed a water use forecasting information system.This system "The National Water Assessment Model". has four primary components. First are base use estimates for all industries per EDA district. Second, future water use practices and economic growth are modelled for each industrial type for each region Th i rd, forecasts are made using the previously derived estimates and finally forecasts are summarized to produce composite regional water use totals for the industrial sector which are analyzed and displayed both. in tabular and graphic formats, Important data files,include: I.Actual monitcrrLig@ of industrial water use for a national sample of 10,000 industrial plants nationwide. 2.Regional economic pattern summaries derived from other census information sou rces. 3.Technical hydrologic parameters such as recirculation rates, consumption and ground water base flow rates and surface storage capacities. All. data and sum mary reporting are disaggregated only to a subdivided majo r river basin. Modelling is done at this level but data is available for individual industrial plants. Geography Program As EDA's primary function is long range economic developmenf planning focusing on areas with severe unemployment a nd low income this program is designed to tie the Administration's socio-economic data to a spatial component for information storage and analysis. 114 Important data Files and systems are@: I)The Industrial Location and Retriey'al System-which holds compilations of community resource profiles derived from data sources generally originating with regional and state planning and economic development agencies.Also a industrial -manurfacturing census which is spatially references and retrievable,an industrial location modelling system and a summary of different state geocode systems are included. 2)Composite Mapping System-has the capability for merging numerous sets of digitized data (Grid and polygon but analysis is do ne only on grid data Files) with weighted combinations for composite socio-economic characteristic mapping.The system currently stores data at any scale in 9. 120,by 120 array of grid C'ells.As currently used by EDA each grid cell corresponds to about Four square miles which is suitable for statewide and multistate data analysis problems. Some standoM analysis outputs of the system are:zones of accessibility to linear or.point foci I ities(highways'or refineries for example), isopleth analysis of continuously variable data such as cl. imatic parameters and locational mapping depicting a range of socio-economic and land use characteristics. National Oceanic and Atmospheric Administration National Environmental Satellite SerAce This service provides satellite observations of the environment by establishing and operating a national satellite system and it conducts an integrated program of research and services related to oceans and inland@woters and the lower and upper atmosphere. Primary data files include: I)Varying nationwide satellite coverages stored digitally in recently developed mass storage systems. 115 2)Environmental modelling and composite computer mapping projects of which coverage is very localized and spotty depending on research priorities and cooperative agreements. Extensive satellite data for all states will be digitally archived and regularly updated 'I'Or public distribution. National Marine Fisheries Service This service conducts surveys of the living resources of the oceans,, analyzing in particular, economi aspecis f R ith an eye on improving ility. to use and conserve o isheries operations wi mans abi these [email protected] agency is also examining alternatives to ocean dumping and provides leadership in promoting wise and balanced managment of the coastal zone. Primary data Files are archived in the National Oceanographic Data Center w@,-Ich is summ,arixed at the endlof this section. National Ocean Survey The National Ocean Survey has responsibility for producing nautical and aeronautical charts. It conducts surveys and prepares charts of the harbors and -coastal and offshore waters of the United States. NOS currently has three major computer mapping projects underway or planned. [)development of a completely automated nautical system by 1980.System will include data acquisition, storageproduction and distribution. .2)Automation of the. pr oduction of aeronautical charts with a complete data system by 1980@ 3)Establishment of a digital data base for geodetic control and North American dat um by 1979. They are on a day i@o day basis producing digital maps of cloud cover, daily weather maps, aeronautical charts, and specialized mapping projects related to the continental shelf,defining state boundaries more accurrately and metric conversion studies in computer mapping. 116 National Weather Service This service within NOAA is resp@on'sible for reporting the weather of the United States and its posessions and provides weath er forecasts to the general public, issues warnings against destructive natural events such as hurricanes, tornadoes and floods. It provides special service is support of aviation , marine activities, agriculture, forestry and urban air quality control. Within Maine the service has appcDximately fifty reporting weather stations which ''are supplying data readily being utilized! by a number of state agencies as inputs into their planning and managment activities. Important data riles held by the service are: I)hnnual compendium of daily river stages. 2)Local and national climatic summaries. 3)Monthly storm damage data, 4)Crop-climate correllation summcrie s. 5)Annual statistics on climatic extremes. 6)Specialized data on various hydrologic and climatologic parameters with only local coverages. Much of the data distribution, storage and archiving by various NOAA divisions and services are through agency environmental data services which are summarized below. Cl" -natic Data Center-mai-11'.-lins nationala rchives on weather dai,:i collected since 1841.Allmattmiols are computer archives on microfilm and magnetic tape. National Oceanographic Data Centeel-maintains the computerized National Marine Data Inventory which stores the records of thousands of oceanographic cruises and projects. Data are stored so they can be retrieved by- geographic limits,the kind of platform involved,by country, state, county, dates, institute and subject.All material is reproduceable for users on tape,punched cards, microfilm, or printed hard copy. 117 National Geophysical and Solar-Terrestrial Data Center-deals with data related to the ionosphere, variations in the earth's magnetic fields,solar activity,cosmic rays, and marine geology and geophysics. Computerized indexes are held for such things as available photographs of the sea bottom by depth, location, and time, and punched card and tape records of undersea exp I orations. There is also a computerized annotated bibliography of undersea geologic sampling efforts.Tape stored records of seismographs from stations around the world are held with the magnitude and location of all earthquakes and follow up data indexed for reffieval.1n.addition, a complete geophysical data base for use in large scale land and ocean surveying and mapping is held. Age-icy Contacts general: 1. N OAA (general) -Robert Gelfeld who is in charge of the ENDEX indexing system. 2. NGDC-Daniel McGuire specific-, I.The National Water Assessment Model-John Klein ADP Operations 2.EDA Geography Program-David Portch-EDA Systems Division 3.The Environmental Dat,3 Centers-Thomas Austin 4. Geography Program of Census Bbreau-Larry Caubaugh-Data Users Service 118 DEPARTMENT OF DEFENSE Corps of Engineers The Corps is primarily responsible for major Federal water resource development activity which involve brigineering work such as construction of dams,reservoirs harbors, waterways and locks.This work is generally inten ded to provide flood protection fpr cities and major river valleys,reduce the co st of t5ansportati6n, supply water for industrial and municipal use, orovide recreational a= facilities for the natio n,regulate use of the rivers and orotect the shores of :the oceans and lakes.The Corps also provides planning assistance to states and other non-federal entities.Most data associated with these activities are collected and archived soecifically to an individual project orientation. Primary data files are: I.The Construction Engineering Research Laboratory Data System contains extensive data with nationwide coverage on rare and endangered aquatic and terrestrial species of insects,plants and animals. 2. The Fort Be Ivoir System for coastal wave dymanics analysis contains coverage on a national basis. 3. The nationwide flood plain map data file is a manual graphic and statistical data file. 4. Environmental Reconaissance Invento -is a manual system for storing and distributing r y 1:25 0 .000 scale moos on land and water resources Jorest types and wildlife habitats, historic sites, fanhdscape enhancement and reclamation potentials. Four states(not including Maine) have been done on a pilot basis and coverage for Maine is planned but has not been confirmed. 5.System of Information Retrieval and Anal sis-contains a national master rile with regional I y breakdowns on agricultural census data :'population and housing manurfacturing locations, commercial Patterns use of public recreation facilities,and environmental I ly sensitive wildlife oreas 110 This data is maintained centrally at the Laurance Berkeley Laboratory and is accessed remotely by regional Corps offices. 6.Hydrologic Engineering Center-contains automated procedures and data files for hydrologic engineering and planning applications, flood hydrograph dato,water surface profiles reservo,ir system analysis, streamf low simulation, rating and flood routing,basin rainfall and snowmelt computation and urban storm runoff.The center actually maintains little region or statewide data' but uses the system for specialized hydrologic analysis. 7.Water Control Managment System -is an,automated data system with nationwide coverage with an emphasis on hydrometric data and river basin modelling utilizing automated satellite i.nformationl( LAMLSAT).,, Agency contacts are: Robert Thompsom--Fort Belvoir C. P. Marks --Waltham Regional Office for New England 120 DEPARTMENT OF HOUSING AND URBAN DEVELOPMENT Division of Housing Managment Primary programs of H.U.D. are New Communities, Community Planning and Developmen.t.9 Housing Production, Housing Managment,and Federal Insurance, Primary data banks relate to open space residential and public faci I ities, water and sewer systems urban rehabilitation, Much of this data also and federal insurance statistics., resides in an aggregated automated form wilhin the Technical Services Division of the Maine State Planning Office H.U.D. does utilize a computerized information system with mapping capabilities for data bases related to subsidized housing' unit types and locations across the country.Data in this system desrcibe projects according to census tract latitude and longitude, number of units the each project,numbers of bedrooms, occupancy rates,and the specific types of program funding associated with the units.Refere nce for future planned and developing H.U.D. data systems can be found in HUD Long Range Plans For Data Automation 1975. Agency contact: Horace Bussell-Office of Managment Information 121 DEPARTMENT OF INTERIOR Fish and Wildlife Service The primar@ mission of the Fish and Wildlife Service,which is responsible for wild birds, mammals inland sport fisheries,and specific fisheries research activities is the assurance of maximum benefit from wildlife resources for the American people.Biologic monitoring, surveillance of pesticides, heavy metals,thermal pollution studies in fish and wildlife populations ecological studies, environmental impact assessment through river basin studies@s stream channelization, dredge and fill permits,and impact statement review, and area basin Olanning ofriverbasins and watersheds are the primary activities of the Service, Man of the data files held by the service are also held within the individual states..A number of files are associated with data systems in Washington. .1.1he Migratory Bird and Habitat Research Lan maintains a nationwide computerized data base on irilard waterfowl and game birds. Banding, recovery, and- survey data from general statewide and region specific studies are utilized to compute recivery rates which are utilized in the planning of a- 6umber of hunting and harvesting associated parameters. 2.Waterfowl Survey System involves a large numbe r of computer files and analysis routines developed to analyze quantitative information on, waterfowl population ecology.The syttem is utilized by a number of bureaus within the service in the preparation of environmental impaact statements. 3. National Wetlan.d Inventory-wi I I be undertaken in Maine 1978- 1979. The survey wi I I result in a digital and map data based on wetland types and locations. 4. Habitat Evaluation Information System is a projected automated data base an d system which will contain information on coastal aquatic and terrestrial animal populations and habitat characteristics. 122 5.A final significant data base is projected to result from a U.S.Fish and Wildlife Ecologic I Char-acterization of the Maine Coast@ (Biological Services Office).The relevant part of this study will include both graphic atlases!of mapped information and a digital data base. Data collection activity will be documenting data on coastal ecology, physical characteristics, pollbtant resideals, land use parameters and economic datc.This information will all be indixed and the infoin tion will be avallable.The collection of data is being varried on in three di rections. Federal, state, andregional sources are being contacted to collect data which has already been assembled within these sources to identify major data gaps.There will also bean Identify obscure data sources(i e. scientific research, thesises ect.). attempt to I Energy Resources Company. the primary contractor, utilized computerized analysis procedures in a majority of their work Availa6i lity of this data will Hepend on their data analysis strategy. They do project the availability of the following data types in digital formats: physiography, geology, groundwater, marsh habitats, upland habitats, land activity, soi Is, discharge sources, critical and unique areasJobster and clam areas endangered species,hazard areas, climatolo( Y, g land ownership and archeologic and historic sites.Digital format has bot yet been finalilzed. Primary contacts are: Warren Blandon- Migratory Bird and Habitat Research Lob Ron Beck-Energy Resources Company Ralph Andrews-Fish and Wildlife-Regional Boston Office 123 DEPARTMENT OF THE INTERIOR Geologic Survey Conservation Division Current responsibilities of the Conservation Division do not generally require the storage or distribution of extensive spatial data bases. It is projected,howeve@ that the division will be holding extensive amounts of spatial data generated by other divisions and departments because of a long term significant commi tment to expand mineral resource estimation programs and pro- duction monitoring activities. important data bases with national coveraged currently held within the division are: Well History Filesdarge sampling for all states) Ilipeline Data Base Field Reservoir. Reserve Estimation System(all significant mineral. resource reserves) I.-Id Information and Analysis Office This office exists as part of the directors office rather than as a separate division. The primary programs of the office are: Earth Resources Observations Systems;The Geography Program; Earth Resources App li cations; Resource and Land Investigations;and The Environmental Impact Analysis Program. The most important data base held within the off! is associated with the Geographic Information Retrieval and Analysis System. of the Geography Program.A nationwide data base for land cover,political units(county boundaries), watersheds, ce nsus subdivisions and federal land ownership is the basis for the system.Mapping scales are 1:100,000 and 1:250,000-All maps will be in polygon format.1983 is the projected completion date for the entire country and digital tapes will 'be available as areas,regions and states are completed. 124 Cover types and land use compilations are based on the classification scheme and definitions for level 2 land use and cover as outlined in Geologic Survey Professional Paper 964.Polifical boindaries will only include county and-state boundaries.The census county subdivisions map provides a graphic depiction of Census Tr Iacts in SMSA's and MCD's.The Hydrologic Unit map provides a geographic reference base for 'all statistical data held within the Water Resources Division of the Survey.Fecleral Ownership(all sources) and possibly State Ownership will be the P, final base map for the system. Cooperativ@ e federal-state agreements can be undertaken and the information will be then available after ho I 1/2years. Two other data collection and storage efforts are of potential interest to state data plannin( and information system efforts.The first isfthe "Image Data Inquiry System" which is held cooperatively with the NCIC and it provides access to indexing for over 6.5 million frames of aerial photography. Secondly is the "Cartographic on-line inferr active digitizing and display editing system" which is involved in research into problems associated with creating large digital data bases.The primary focus is currently working on the measurement of potential data volumes and the impact of these volumes on digitizing methodologies. The programs of this office are generally coordination oriented so there are few actual data bases held but rather the division is involved in developing protypic data bases and data handling systems. Geologic Division The activities described under the Geologic Division are primarily oriented toward research and clevel opmen t. Although many of the associated data bases are @extensive most of the divisions activities require or result in point geologic data.Most of the data bases are oriented toward solving rather specific problems -usually at the project level. However the division is working with the Bureau of !..-:7,,d Managment on methods to better reference the location of minor civil divisi ons in large scale computer oriented mapping.The division is also at work on a system winich 'Will allow the retrieval of geologic point data by political boundaries and deed descriptions. Primary data files of the division include: Radiometric Age Data Bank Paleomagnetic Data Ba:-,< Earthquake Information System Geothermal Data Bank Digital landslide suseptibility determination. Oil and Gas Files Geomagnetic Analysis Rock Analysis Storage System Computerized Resource Information Bank, Geologic Synopsis Program Computer Composite Mapping (detai I ed mapping done by special agreement ohly) Applications of Computer Cellular Mapping to Planning (research oriented Geographic System) Topographic Division The primary objective of this division is the provision of basic cartographic data for the United States as required by other federal agencies,state and local organizations and the general pu blic. The division is concerned chiefly vvith the production of any map seri es which requires national coverage or national mapping standards. Topographic and planimetric maps of the division include. I)the standard mapping series(non-digital)-1:24,000-,I: 62 PC',and 1:250,000 which by far have been the tradition focus of the program. 2)a variety of feature and color separates based on the above base information. 3)a new 1:100,000 intermediate scale which is currently only planimetric but is projected to be extended to include topographic information. 4)1:50, 000 and 1: 100, 000 county maps. The division.only on special agreemen t will provide other cartographic products derived from existing data bases-slope class mapping,aspect mapping and cultural feature mapping. The primary existing computerized data bases@of the Topographic Division are: I. Computer generated shaded rel 1 ef mapping 2.Aerial Photography Quadrangle File System 3. Geographic Names Information System 126 Perhaps most importantly is the recognition of need by this division for a national cartographic data base in digital form.This recognition is for both topographic and planimetric data types. The objectives of this proposed "National Cartographic Data Base " focus on the provision of appropriately coded clean data types for a number of different data types and parameters. The data currently projected to be inc@ luded are: I geographic referencing-coordinate -base maps 2.hydrography-streams and rivers, lakes and ponds,wetlands, reservoirs and shorelines 3.boundaries-political jurisdictions, national parks and forests, mi Ii fary lands and similiar entities. rectangular survey system-the public lands survey network 5. transportation systems-roads, railroads, trai Is, canals, pipelines, transmission lines, bridges,and funnels. 6.hypsography-con tours, slopes and elevations 7.geodetic con trols-monuments, markers, landmark structures 8.geographic names 9.all man made structures io.vegetation The plan is to have a prototype system; in operation by 1980 and the data base completed for the entire country within another 10 years(1990). data elements Thefidentiffed for inclusion in the system will be separated into two categories: I.digifal terrain data filas 2.planimetric digital cartographic files This separation is a function of data acquisition methods.The data base will use the 7 1/2 minute topographic quadrang!a,;eries of maps as the basic working unit for data acquisition, entry and storage. Preser-it plans are to use stere6graphic photogrommetric models for all areas unmapped at the time of potential data entry. one. The digital elevation data will be recorded as file per 1:24,000 scale quadrangle on magnetic tape.Within each elevation file,:data are to be arranged in sequential logical records. Three types of record formats are utilized.The first contains @pneral characteristic information about the digital model including quadrangle name, latitude and longitude of the lower left corner and UTJM coordinates.The second is a variable length file used for the actual elevations.The x and y coordinates are identified and the location of the remaining points can be computed -.)r estimated from pattern and spacing of points,rows and columns,and visual checking of the elevations against graphic source materials.All e@evations are in feet.The last record contains the number of elevations for the complete topographic map record. The planimetric Files will contain information derived by digitization. This will be done by manual digitizing,interractive line following and possibly raster scanning. Planimetric cartographic files are stored on magnetic tape and consist of two main sections in the proposed archival format. The first section contains [email protected] information for the file among which is a count of the number of overlays in the file.The second section is divided into a number of subsections, each su bsection for each independant network covering each cartographic, data type., The header section contains three logical records.The first includes the name of the map, local Y4 :mid y coordinates for the four corners of the quadrange sheet, map projection codes and the latitude and longitude for the southw est corner of the quadrangle sheet. The next subsection con- tains the number of networks, lines,nodes and areas. Every line segment on the map is coded with a starting node, ending node and left and right parc el identifiers.All coordinate values are to 001 inches.The third section consists of a number of parameters which are important for internal data base managment routines. The graphic section consists of node,;area and line elements.The node list is fixed in length and contains the mal'or and minor fields for the particular attributes being addressed.The second list contains positional coordinates which define the shape of the particular attributes.The third data list is reserved for descriptive texts. Water Resources Division The Water Resources Division is responsible for the Survey's appraisal of water resources and for research in hydrology related to -the occurrence distribution and quality of surface and ground water.lf is responsible for the coordination of water data collection activities of all federal agencies and for the design. and maintenance of data collection networks and for providing inform- 128 ation on the availability of water related data. This particular requirement is met by the previously discussed National Water Data Exchange (NAWD EX). Water data collected from hydrologic 'data stations,areal hydrologic information,and var'lous types of interpretive studies are stored in the divisions National Water Data Storage and Retrieval System'WATSTORE).In addition ithe division is performing digital processing of remote! sensing data for a number of purposes-predominately in a national wetlands mapping program, spatially mapping relationships between water quality data and land use change and in the assessment of various types of mining activity on water quality. Specific data files associated with WATSTORE include:streamflow data;reservoir contents;river stages,water temperatures; conduc ton ce; sediment con centrations;sedirien t discharges;annual maximum stream flows and river stage data. Agency contacts: i.Geography Program-Eric Anderson,Bill Mitchell, 2. RALI Program-Olaf Kays, Ethan Smith, 3.LUDA Program-Richard Witmer 4. National Cartographic Information Cent@r-John Swenmorton 5. EROS Data Center- Allen Watkins 129 DEPARTMEITT OF MERGY ENERGY RESEARCH AND DEVELOPME@,g ADMINISTWI10N) The mission of the Energy Research and Development Administration is to consolidate ev federal activities relating to research and d elopment of various sources of energy in order to increase efficiency and reliability in the use of all energy resources.As well on idealized long terms goal is to make the nation self succificent in energy and to advance the goals of restoring i protecting and enhancing environmental q uality and to insure public health and safety. 'Office of Environmental Information Systems This branch of. ERDA has the responsibility of initiating and coordinating research efforts into relationshia, s between energy planningand comprehensive environmental planning.The major thrust of the effort (conducted within the eight national research labs) is focused on developing environmental data bases and automated data handling capabilities for utility in compreshensive energy-environmental planning.The major systems and data bases of relevance are: -(Specific systems are not detailed but informatf on on them is being compiled by the State Planning Of fi ce) 1. Brookhaven National Laboratory--construction of an automated system for modelling relationships between land use and air,water and noise pollution.There are extensive data bases on air and water pollution for Maine much'.of which was initially gathered by in-state sources. 2. Laurance Berkeley Laboratory-contains no relevant data bases but is involved in extensive research and development in interactive comp)u&er graphics and grid based landscape analysis systems. 3.Pacific Northwest Labs-are involved in the development of a polygon mapping system. 4.Los Alamos Scientific Laboratory is involved in extensive studies incorporating demographic analysis procedures into spatially referenced information systems.As well,the direct processing of satellite do to is a focus of this labs research. Agency Contact: T. M. MillerJ office of Environmental information Systems 130 ENVIRONMENTAL PROTECTION AGENCY The EPA has programs in air and waste managment,water and hazardous materials, legal enforcement of environmental regulations and research and development. EPA coordinates an'd supports research and antipolfution a Ictivities by local governments and reinforces state and efforts within the federal government aimed at accounting for imact impact assessment in their various operations. It also makes public @its written comments on environmental impact statements. Most data bases held by EPA original' te from state by state reporting sources so most data types already reside instate. Information systems are listed because in some instances data may be accessed more rapidly from EPA than by the comp Iilation of instate data which is archived there. Data systems with national coverage are: I Standards and Regulations Information System II.Air Pollution Information System 2. Pesticide Emforcement Managment S stem 12. Energy Data System y 3.Discharge Compliance Data System 13.Solid Waste Information System 4.STORET 14. Environmental Assessment System 5. Pestidies Registration Wem 15. Noise File 6. Pesticides Analysis and Control System 16. Population Studies System 7.lnventory of Public Water Supplies 17.Models for Fresh Water Ecosystems 9.Storage and Retrieval of 18. Eutroph i cation Study System 9.Survey of Municipal Waste Water Treatment Needs IO.National Emissions Data System Agency contact:Offi;ce of Environmental Information Systems EPA. 131 4.2-322 Identification of State bata Sources The purposes of-this effort should be numerous. First it should present a baseline assessment of the data which are currently and projected to be available from state sources. This permits both an assessment of data availability from state sources but also function to measure met and unmet data needs when data available is examined.along with th e user needs assessment, It must be reiterated that data availability assessment is viewed as a continuing process and the information presented here is intended to function only as initial baseline information. Currently there are no complete indexes of natural resource information for the state of Maine. However, TRI GOM (The Research Institute of the Gulf of Maine) has been instrumental in the compilation of indexes for specific data types and in an effort currently in early planning stages to develop a complete interactive automated resource data index for the state. Previous efforts have included.the KWIC Index of socio-economic and environmental data and a Maine Rivers Bibliography. Based on information and. experience gained while preparing the New England Eridex, TRI GOM has proposed the development of a complete data indexing system. This system will allow any data user needing references to documents or files to go to a computer terminal where he can search a bibliography using specific key words. Modules will be developed separating the complete data base for simple, fastf and economical data searches. The modules will separate the data base by data type (i.e., map vs. tabular), data coverage @, -e.. marine resources), and geographic coding (statewide, local specific). It is projected that if implemented, the Maine Index will form a cornerstone in the data planning and geographical information system efforts. A second index of primarily spatial or map information will also be prepared for Maine during the summer and fall of 1977 by the Research Institute of the Gulf of Maine. Both hard copy and are interactive automated retrieval &, planned. This index is being designed according to a similar index recently completed for the State of Wisconsin. All spatial information with coverage in the 132 State of Maine will be documented according to the. following parameters: data type, dates of collection and publication, detail,. aggregation, quality, timeliness, sources, lengthy description and contacts for information and acquisition It is planned that once. the base is established it wi.11 be updated annually as part of a state data planning effort. Once completed, the GIS feasibility effort could utilize this information to deve lop reports of various data sets and how these data relate to probIlems in establishing a spatial digital data base. (For example, data documentation and evaluationi for specific data types.) 4. 2 .'4 Data Documentation As data availability assessment continues, based on data types identified by user needs assessment and/or from system design specifications, the characteristics of pertinent data should be fully explored to determine potential utility, to users within the system. Data residing in specific agencies being utilized only by those agencies with no other apparent users need not be evaluated as part of this process. A systems central available data base is intended for information and data types for which either a number of agencies have shown a need or for single agency which does not have the expertise or resources to analyze the data in house (L.U.D.A. Cover Mapping). An effort involving the construction of a matrix (Figure 4. 1, which is an example from the Texas Natural Resource Information System) which delineates data flows within Maine could be easily accomplished and would yield a concise picture of data use and sources for the state. State Planning Div:L sion Office Technical Servicesend TRI GOM data indexing makes such a matrix easy to assemble for review by the various user agencies. 4.2.4.1. Documenting Existing Non-Automated Spatial Data If the specifics of data collection mo de and resolution of data are acceptable for utility spatially referenced data may contain several types of errors. Prudent use of data indicates systeriiatic 13@ SUMMARY OF NRIS DATA/INFORMATION NEEDS USER AGENCIES AGENCY NRIS FILE NAME TOTAL RANK TWRC FTO7A TSWC TRRC TSOH TPWQ THD TWOB TIC GLO TES ACB BEG TWDB GENERAL LAND OFFIC E S"' ?:ta File 3 3 6 3 6 6 R,,tV'M,sLt:,';i 6 3 2 1 6 87 School Land Master File 5 5 61 2 6 5 85 1 Veteran's Land Master File 5 5 6 Veteran's Land T_t_at_i-s_tT_caT_ File 5 1 6 3 6 90 Unsold School Land File 3 1 6 1 89 Land Classification Master File 3 5 6. 5 85 _-MREST SERVICE 11. -rEXA FO TFS Woodland Management Activities 3 3 6 6 6 83 TFS Southern Fine -Beetle .0 2 3 83 oper t' TFS Wildfin: Reports, 4 3 1 5 89 W 111. TEXAS HIGHWAY TFS Fire Weather Re .ports 5 6 2 3 3 6 1 81 F9 DEPARTMENT I I Air Pollution Data 6 5 2 3 6 2 80 18 b b -7 �7- Structure InvenTory an(T Appraisal 6 6 2 5 89 -Road Life Data b 6 3 - 84 8-3 Automatic Iraffic Recorder 4 _T_ 6 Truck Weight Data - 3 6 90 Speed Survey -5,_ ___2_ 6 4 87 -Tr-19in-Destination Survey 5 __T_ __4__ 6-- 73 _3 6 80 18 Roadway charaZT-er-Rst-ics, State System 5 2 3 6 6 .1 85 Koadway Characteristics, County System 5 2 3 6 6 85 Traffic Log - 1 4 3 3 3 6 80 18 Kail-111ghway Grade Crossings Figure 4.1 Example Data Needs Summary for the Texas NRIS (source.: Texas NRIS Report 1974) data checking which should uncover errors if they do exist. Several errors and methods for documentation are presented below. A. Horizontal accuracy - because of time and economic constraints only relative determination of positional accuracy is generally feasible.@ For example, a simple visual comparison with a separate related data set at the same scale is a good test of positional accuracy. Another simple method involves duplicate compilations for limited sample areas using different methods or draftsmen. Another method is to relate cartographic representations to aerial photographic images to obtain checks by coordinate comparison. As well., it-is often valuable to coorelate information found on maps of differing scales. B. Map contents - should be checked because data items may be classified incorrectly or ground conditions may change between the time of data compilation and time of projected use. Generally, any information checking must depend on a sampling scheme. Random sampling and ordered sampling are two such techniques. I t is noted here that many map products which will be available have such determinations of content accuracy associated with them. Variance and standard derivation are used to quantify the validity of the data. C. Inaccuracies of base changes - is important because enlargement of map scale, perhaps from 1:250,000 to 1:100,000, enlarges line data and aggravates the displacement of any features. An acceptable map at a given smale may bedeficient at another scale. As well, when maps are reduced a di-fferent set of problems result. &cessive detail often clutters the new map; information may be repetitious and difficult to read; and,places where adjacent records meet may be disjointed. 4. 4. 2. Documentation for Automated Spatial Data Files Data may be acquired and recormati-ed into a number of formats and investigation is usually required to determine which format or mixtures of formats will provide the best possible data base. The task for computer oriented data is one of assessing the quality and quantity of information and 135 determining the sata transfer processes that will permit computer storage and manipulation of data. The data format, record formatf and volume of the data to be handled are the prime determinants of any data handling capability and generally have a great impact on computerized data handling techniques. If the data is not directly useable.and understandable at the point of acquisition some initial processing will be required. The various classification schemes used for computer storage must be examined. Verification includes checking the logic of particular systems to determine if the characteristics which have been encoded can be perceived on source documents and maps. In terms of recoding and formatting choices must be made among the various coding methods because the'potential classification of data manipulation routines and File structuring are strongly impacted by coding and format. 4.2 .5 Spatial Referencing A separate study is required of available spatial reference systems which can be either implemented as a standard for an information system o .r used with available data that requires locational identifiers. This task must examine and evaluate base maps, coordinate reference systems, and other methods of identification such as street address systems. This step is critical because there exists an unsolved problem of designing a spatial referencing system which can operate efficiently at both the regional and state leval and the urban development level. The problem must be addresses and it is likely that a resulting system will be multi0format capable of manipulating and relating data encloded in different formats and different scales. The interface between urban development and typical rural uses is generally of most interest to planners and it is accurately portraying this interface which is of great difficulty in spatial referencing systems. 136 4.1 'Data Avai I ability Summary ,*6. For Maine a preliminary review of federally available data, the status of current in-state data acquisition, and the characteristics associated with those data suggests that: A. Data relevant to the needs of users exists in a number of media, aerial photographs, maps, tabular summaries, and computer storage devices. B. Few efforts (i.e., the coastal p rogram) have begun to pull these data into a consistent format and to make the data availab le in a coordinated way. C. Data seri es (i . e. , soi Is) are av ilable in varying degrees of completeness and timeliness. D. Perceived data needs differ among agencies with even similar responsibilities. E. General standard analytical framework do not exist to guide collection of data for state resource planning programs. F. For federal data sourcest most series are currently incomplete for Maine or are available at smaller than desired.scales. The data formats and definitions of individual series (i.e., surficial geologic mapping) have not been related to needs for plann ing'and decision making being based instead on scientific and technical criteria. 137 SECTION V COMPUTER HARDWARE AND SOFTWARE AVAILABILITY 5. 1 Introduction......................139 5.2 Survey of Software ...................139 5.2.1 GIS Software issues...............141 5.3 Surve of Available Hardware.........155 5.4 Availability of Personnel ...........170 5.5 Some Hypothetical Alternatives...................................171 5.1 Purposes of a Maine GIS Facility (Mid-level)........................172 5.5.2 Hardware and "System" Software Requirements for a Maine GIS Facility (Mid-level)...............174 5.5.3 Applications Software for a Maine GIS Facility (Mid-level.........................................179 5.5.4 Resource Availability for a Maine GIS Facility (Mid-level) ...........o ...... 180 5.5.5 Operations of a Maine GIS Facility (Mid-level).......................................181 5.5.6 Estimates of Costs and Associated Parameters by Source of Computing Resource (Mid-level)...................................................182 138 Chanter V Assessment of Software, H@ardware, and Manpower Resources 5.1 Introduction Investigating existing software, hardware, and manpower resources which are currently available, or which will be available is required to develop a listing of resources for system development, operation, and maintenance. This step is not to select equipment or software or to determine personnel require- ments but rather what is or will be available, what it costs, and how various canponents may be obtained. This@assessment is vital for two basic reasons: 1) The equipment market is highly competitive ana vendors frequently make claims which are difficult to substantiate, and 2) Necessa:r7 funds must be available for the time when equipment or soft- ware will be required and personnel hired. Survey of Software When data manipulation, analysis, anu output requirements are defined the available software must be located and thoroughly documented. The software survey must also list hardware requirements for each software item. The que- stion of transferability must be explored in detail. Within this effort impor- tant software sources have been identified and information collection is and will continue to be underway. Some of these sources include: Federal Agencies USDA Forest Service USDI Geologic Survey aierggy Research and Development administration States Federation of Rocky 'Mountain States 139 1-11ew York Maryland 'Monlyana Hawaii Alabama Arkansas Nortn Uarolina The Uanadian Geographic Information System Research University of Iowa Simon Fraser University New York University at 3uf-falo University of Saskatchewan The National Labs -light Center of NASA Marshall Space I as well, the effort will Drocure and evaluate a number of software compi- lation and indexing efforts: 1) International Geographic Union's 800 page compendium of computer grapic software. (Hard copy available i-rinter 1978) 2) New York State Geologic Survey is.currently engaged in a comparative geographic information system study examining a number of statewide systems. (1978) 3) American Society of Planning Officials is evaluating the comparative advan tages and disadvantages of a number of statewide and regional systems for the RALI Program of the' Geologic Survey (1978) .140 G.I.S. So it ware Issues Software for Geographical In rormation Systems is currently in a state of flux as the emphasis changes from grid-cell-orientea system:3 to polygon-oriented systems. The greater flexibility of the polygon approach and the elimination of many scale-effect problems inherent in a grid system have provided the ariving force for this transition@. Due to the computational complexity of some operations involving polygons, these procedures are often approximated by grid- cell methods. As better polygon-briented algorithms are developed and as hard- ware advances (cheap parallel processors, array processors, networked micros, and software support for microcode facilities) are brought about., G.I.S. soft- ware will come to execute all primary operations directly on polygon-represent-, .ing data structures. Many fundamental issues in the analysis of spatial data remain to be answered by scholars. This is reflected in the paucity of analytic tools in all types of G.I.S. facilities. The meaning of spatially distributed data must be imputed from the G.I.1S.1s graphic ?z tabular outputs by human, not programmatic means. G.I.S. software will continue to evolve in the years aheaa as more complex hardware and more sophisticated analytic tools are brought; to bear on the more difficult phas es of computer-carto-graphics. Over the next couple of decades, pixel-oriented data from remote sensing systems will be in- tegrated into G.I.S. systems. 3reaktnroughs. in data storage technology and substantial advances in the mathematics of picture grammars will,allow an even- tual intepration of polygon and pixel (picture) systems. The' process of selecting potential Geographical Information System soft- ware shoulu continually be modified to take into consideration the State's changing needs for output (manipulation and analysis included) and the changing tableau of resources being offered by the several software sources. This on-going survey should assess not only the capabilities of ttle software offer- 141 ings, but also such issues as transportability, ease of implementation, and maintenance, quality of source code (correct It robust algorithms and lucid style), execution.efficiency (computational algorithms @z disk 1/0 mostly) & hardware and software requirements. Within this effort important software sources have been identified ana a continuing information collection process has been initiated. Most of the G.I.S. software systems monitored will be in the public domain and will beavailable at a nominal charge for copying the transmittal tape.ancl for documentation. In some cases technical assistance and training might have to be acquired also. That is likely to be billed at commercial rates where ,the activity does not fall under an existing inter-overnmental assistance program. Additional information may be collected on a continuing basis by scanning NTIS uOSMIC (NASA), publications together with GE-0 ABSTRACTS -a UOMPUTERI and L; ONTROL ABSTRA(;TS. The National.CartograpIl-Lic Information uenter is also likely to be of some assistance in monitoring G.I.S. software developments in Federal agencies. Detailed evaluation on a continuing 'oasis will require 350-1000 hours a year by personnel knowledgeable of GIS and the applicable programming languages. The lesser estimate would.be appropriate where management policy restricted the scope of the monitoring process by eliminating from consideration certain classes of G.I.S. systems such as those written in PL/I, those based on grid cells., or particular data structures, or those written for systems that would be difficult to convert for use on State hardware. Beside the basic survey of G.I.S. software several other monitoring efforts will be required. No G.I.S. system will provide all the analytic ca-pab-Llities required by line agencies for "production" tasks or by researchers for more erudite endeavors. Software compatible with the G.I.S. hardware must be moni- Uored in the following areas to augment the core G.I.S. capabilities: stati- 142 stics, @-eo-rap`r , graphics, modeling & data base management. The most likely candidate for a staistical package is SPSS (from SPSS, Inc.) because of its ease of usel its professional maintenance, and its avail- ability on many computer systems..@ Other packages such as BMu(P), OSI.KISZAS shouls be profiled also, but the'dearth of sound statistical packages in the small to medium systems market and the sales potential (several tens of thou- sands) should lead to the introduction of several significant new statistical packages in the next few years. DMIs offerings should also be noted because of the very high quality of the subroutine package and its potential (either as parts of user-written programs or unified by a data managing "ariver") for use in tandem with a G.I.S. facility. Auxiliary geography programs are available from several sources i.e., University of Kansas, University of 11ichigan, Bureau of the Census, and Harvard University. The programs available from these sources are usually special purpose packages that provide a single or only a few functions. The input and output of these sundry programs are not, in general, standardized but they pro.. vide a ready source of "raw material" that an adept programmer can meld into an existing G.I.S. facility at lo w cost. Another class of software that should be preiodically examined is conven- tional computer graphics. Packages range from basic systems to control graph- ics equipment such as plotters and image displays to sophisticated systems foi, interactively developing complicated graphic prsentations. Although the data structures normally used in computer graphics differ from those required by G.I.S. and cartographic applications,,the graphics software will be applicable (at least at a low level) in many:situations. Charts and graphs for reports, blocks of code for shortening the!development time for some G.I.S. modules, .and most aspects or a computer-assistea manual- cartography system can be deri- ved irom existing computer graphics packages. The three best -known systems 143 are GINO-F from the Computer Aided Design Center, DISSPLA from Integrated Software Systems Corp.., ana GCS from the U.S. Army Corps of Engineers. A detailed profile of Such systems found in 1977 ACM SIGGRAPH publications provides a good review of existing systems. The model-making software tools which will be required to simulate natural and human systems fall into four general catagories (1) discrete (SIMSCRIPT from CACI) (2) continuous (DYNAMO from Pugh-Roberts Associates for systems dynamics modeling and CSMP from IBM) (3)hybrid (GASP IV from Pritsker Associat rograms wri es) & (4) specialized subject-oriented p tten mostly in Fortran. SIMSCRIPT is currently undergoing conversion to sev eral mini systems, and is currently available on a number of mainframes. DYNAMO exists in a transportable Fortran-based version (although it is not the latest version), and GASPN is written in ANSI Fortran and will run on almost any system. The status of these basic systems and the avail- ability of other packages should be monitored in "Simulation" and the an- nUal ACM roster of programming languages and by vendor contact. Data base management systems should be profiled to determine if avail- able systems provide any.of the capabilities required to maintain the geographic-oriented data bases of a G.I.S. facility or to interface with socio-economic riles. Although any respectable G.I.S. is in part a data base management system, this capacity may not be conveniently available to perform such activities as editing or inquiry activities and it may not be adaptable to the maintenance of other types of data riles. Most DBMS's tend to be expensive, difficult to transfer from one type or system to another, and available on maxi (mainframe) systems or large minis. Both independents and computer manufacturers offer DBMS, but with the exception of possibly only MUMPS, none is available across a wide range of systems. Standards for DBMS exist but are only loosely applied in practice. Infor- 144 mation on this tonic is best obtained from, the hardware vendors of poten- tial systems, traae publications@(the sellers of this commodity advertize heavily), and organizations such as the 'IMMIS users group. The language in which the chosen G.I.S. is written and the language used for in-house programming will a@rfect many aspects of a Geographic Information System operation. At the moment, only one language, Fortran, shoula be considered for either role. The basic requirements for selecting a primary site language are: (1) ability to do the job, (2) few hardware selection restrictions, (3) appli- cation so_ftwar,-:@ (4) no major software transfer problems, 3 ease of writing, reading, rd aintaining programs ?z (6) availability of trainect programmers. Of the large number of languages currently in use only a few merit notice as the primary language in a G.I.S. context: Fortran, Assembler, PL/I, PASCAL, &PL, and Basic. Fortran meets all criteria reasonably well. It is a scientific lang- uage well adapted by design philosophy ana compiler optimization attempts to function well in a "number crunching" application like graphics. It's major.weakness is the lack of sop histicated data type capabilities as part ol the language itself. Data structures of any type may be created, but all the details must be seen to explicitly by the programmer. The language does not relieve him of any of these clerical tasks. Despite this drawback, most G.I.S. systems are based on Fortran. Almost all computer systems have Fortran compilers, so few hardware selection problems would result from the. choice of Fortran as a local standard. A vast amount of quality'soft- ware is available in @*ortran in the areas of graphics, cartography, stati- sties, numerical analysis, simulation, and other fielas relatea to a G.I.S. situation. Some transportability problems exist in spite of approved stand- 145 ards for the language, but it is the language of choice (of most prograrmiers) when transportability is the primary issue. Fortran code, the not inher- ently modular or structured, may be written in a comprehensible and main- tainable form. (Of the languages under consideration only PASCAL of."L"ers significant advantages in clarity.) A ready supply.of wortran programmers exists as most colleges use this language as the primary vehicle for instruc.- tion in scientific computing. A new Fortran standard is about to be introduced. A number of t-he language's weak points will be corrected, but a flexible data structure f facility will not be included. a2isting Fortran IV programs will run under the new @*ortran 77 (also called Fortrev) with few if any changes. Assembler is not an appropriate choice for a primary G.I.S. system language. It is hardware-dependent, is not transferable in general'& pos- sesses no store of applicable software. Assembler programs are difficult to maintain and-few programmers are skilled in its use. PL/I.merits some consideration as the primary language in a G.I.S. situation. It is designed .for scientific computing.(amongst other things) ana ofiers more data structure handling &apabilities as part of its syntax than does Fortran.. PL/I tends to be somewhat slower than Fortran and does not directly offer all the data structure capabilities required by a G.I.S. implementation. The language is available only on a small number of sys- lems altnouc@h that number is growing. Only a modest amount of G.I.S.- oriented applications software is available. The lanc-uage is reasonably easy to use if one staysaway from its arcane features and takes proper advantage of its.block structuring. It is somewhat difficult to rind good technically-oriented PL/I programmers. PASUAL merits future consideration as a basis for G.I.S. systems, and current consideration as a research tool.. Its structured programming orientation anu user-definable data structures make it a near-1,7 ideal @146 logical framework for a G.I.S. system. Because most implementations are interpretive or semi-interpretiv execution is slaw. versions utilizing fimviare "target machines" or compilers should relieve the speed problem. PASCAL is currently available on a small number oi machines but its rapid rise in popularity and the relati ive ease with which it is implemented T-rill likely change that situation. Li ttle application software is available in PASCAL and few practicing prog rammers are familiar with it. Experi- mental versions exist which are o riented towards parallel task execution. APL and BASIC have both been@used in graphics situations (often by the addition of graphic llprimati7@esll) but their usual implementati-on as interDreters: makes them too slow for the heavy computational loaa pre- sented by geo-graphics. while BASIC is nearly universally available, kPL is available on only a few systems. A siggnificant a;aount of applications programming exists in BASIC, but in general, it is not sophisticatect enough for a scien-rific/G.I.S. environment. Very little applications programming is available in kPL. Some transportability problems exist with both lang- uages. while BASIC source code (for medium and large programs) is not particularly easy to maintain analmodify, APL programs are distinctly worse. A gooa supply of BASIC programmers exists, but few programmers are conversant with APL. These two languages are best relegated to the posi- tion in which they %-rill best serve; as adjunct languages in a G.I.S. setting 4- arns that will be used once. they offer a very quick way to write short progrz Software transnortability is a major consideration in configuring a G.I.S. facility. It would not be financially possible (and certainly not reasonable to attempt to write all G.I.S. and ancilliary programs in house. Many quality puoliu domain and propriet ry packages are available 'or a ua fraction of their development- costs. To adequately make use of this enor- mous resource (anul in some small way to contribute to it) one must have 147 a c-rasp of some of 'he basic issues of software transportability in a G.I.S. context. Object code (prograns-essentially in the form that the computer executes) is not covered here since it is usually transporta@ble only between a limited number of identical or nearly identical hardware systems. Source code (programs in the form that is written and understood by humans) for many languages is much more easily tranfered from one type of system to another. Programs known as compilers or interpreters (peculiar to each ccm-o uter) translate the source code of a high-order language such as BASIC or FOhTRL.U into the "unique" form accepted by a given computerts processor. Given the constraints of a G.I.S. environment transportability needs to be considered only in terms. of a restricted number of languages. PASCAL is not widely used currently and little practical experience ex:ists in transferring large applications packages from one implementation, to another. PASCAL does have the advantage of formal ciefinition that is a clefacto stancara. The most frequently encountered implementation is a transDortable compiler which proauces code for-an imaginary "target" mach- ine. _kbout the only system dependent feature is the interpreter which then decodes these instructions in terms understood by 'he particular com- puter. This approach is less likely to produce compatibility problems than is the unique (machine-dependent) compiler approach. The rapid in- crease in popularity being experienced by PASCAL will likely generate de- velopment activity that should be,monitored as part of a G.I.S. planning effort. FORUM is the most easily transported of nearly all languages. This is in part due to the existence of a formal standard for the language (1966). 2eside this official standard, a kind of unofficial standard developed over the years as vendors added features (such as direct access, quoted hol- lerith strings, generalized subscripts, variable aimen.sion sizes,...) that 1.48 were commonly adopted by the industry. IBM G-level fortran is represent- ative of this practical standard., A new official FORTRAN standara (1977) is in the works. It is not yet lear what affect the new standard will C have on program transportability, but the problems are not likely to be severe. Difficulty in converting programs written in one vendor's FORTRAN to run on another system often results from subtle deviations from the standara or alternative interpretations of the imperfect or incomplete description of the context by the standard. @Details of implementation are not spelled out by the standard (and probabl@ shouldn't be) and this too leads to un- foreseen problems. Several different schemes exist for representing inte- gers and floating point numbers.@ when a program being converted makes use of the bit patterns in a way that the facilities of the 14'ORTRAN system would not, difficulties may arise. Computational situations, tho synta- tically incentical may result in@different answers on two systems because of different rounding, truncating, word length, and algorithm features. Differences in collating sequences can cause great problems with sorting and searching operations. "Do loop" parameters, subscripts, anu subpro- gram arguments are other constructs commonly subject to subtle differences in performance. Some computers are mostly register-oriented, some provide extensive -hardware support for stock operations, and others are essentially stack machines. These design differences give rise to variable capabilities for identical blocks of coue. For instance a mult-tasking rortran-written applications package developed on a st.@ick- machine and using recursive and reentrant features of the code will not run on a register-oriented mach- ine that does not allow subroutines to call themselves and does not permit shared code. Many other features of general system design, compiler imple- mentation, and operating system interaction may cause software transfer problems. 149 In practice the issue is co.n*sid.erably simpler. A program of "foreign" origin canfail. in only a few basic ways. If it does not compile the pro- blem is usually easy to spot and fix. If it.does not execute properly, a few runs with a good test data set will usually find the difficulty. The remaining small percentage of conversion malaises should be referred to a competent programmer. The programmer should identify the trouble- some,code with debugging techniques.and should rewrite portions of the program to isolate potential trouble spots. Rewriting stylistically poor code and inserting adequa:te internal documentation will eliminate many problems in this class. Another approach is to use the PPO.RT Verifier to isolate coue which violates the (severe) PFORT restrictions. Once re- written and tested against a sound test data set most problems will have been removed. writing-FORTRiLN programs for in-house useor for distribution requires stylistic dicipline on the part of the programming staff and sound docu- mentation practices. Programs should be written using only those features of the local FORTMIN com-oiler that meet tne standard. (The vendor can supply the National 3ureau of Standards test results.) Non-standard or necessarily-clumbsy code should be placed in seperate modules and appro- priately identified as should code which may bemathematically unstable (mostly problems revolving around finite arithmatic precision). Alternately the software may be written in PFORT or may be produced using the pre-pro- cessor.techniques developed by DISL. The latter is only practical in a large scale soztware development and distribution situation. The lifespan o! soitwareis deter-adned by a variety of factors inclu- ding subject-orientea advances. irariovations in haraware in the areas of Darallel nrocessing, content addressable memories, and networldna will generate future software regimes potentially much different from today's. 150 Many present systems will become obsolete because an inherently superior .methodology will be available. The same ware of technical innovention .that will obsolete some software systems and languages will also provide the means for easily maintaining old, but useful, software. Dnulation, compiler generators, and other software techniques will allow these pro- grams to be moved with some easeito a novel, non-native, environment. The computer industry is not likely to forget the problems of moving from se- cona generation to non-compatibl@ third genera tion systems. G.I.S. soft- ware is likely to be in a state of turmoil for trie forseeable future. 1.)etter, or even just appropriate, algorithms will be found for such tasks as polygon overlaying vector to raster conversion @c vice versa, manipula- tion of sparse matrices @e statistical analysis of spatially distributed data. The whole realm of picture processing is undergoing rapid mathema- tical development and it will have substantial future impact on Geographic; Information Systems. The near future is likely to see some limitect forms of paral I el processing become available at a reasonable price. Many G.I.S. functions are inherently parallel in nature (intervisibility analysis, -ridding, contouring, overlaying, some types of sorting & searching,---) @ana the current sequential approach to these problems will not endure for I very long (probably less ths@n 15'years at the outside). Illustrative is the task of overlaying polygons which has given the USGS so much trouble. "fultiple parallel processors could cheaply and quickly solve these pro- blems and the software to do the.task is conceptually simple. Parallelism in the form of a group of networked minis could even toaay offer a practi- cal, bruteforce solution. In general the future of G.I.S. software is one of change. The best that can be hopea for in an operations sense is that t-he transitions are smootn. 151 a new dicipline ana in Geo.-raphic Information Systems are in part part a novel integration oZ existing specialties. 'he care of the sub- ject is computer science (data structures.9 data management, and algorithm specification and implementation) along with sone basic cartography. Key personnel must be skilled not only in trie centurol GIS concepts but also in the peripheral conipone*-n-ts drawn.from mathematics and statistics. In practice the primary project inaividuals need to possess a basic undier- suanaing of the technical issues faced by the clients they will beserving in the natural systems and human services fields. The selection of per- sonnel is the most important phase of initialing a G.I.S. capability but it is the most difficult area in which to prescribe optimizing action. Top personnel must not only be good technicians but they must also have a very clear perception of where they ana their G.I.S. system stand in the scheme of all things. Tho this technology offers powerful means of organizing, manipulating and displaying data and of building models to give insights into the workings of the world, it also offers subtle and alluring pitfalls. Not only is "garbage in garbage out" a data processing truism, but, in a G.I.S. context of applying simple computer models to complex natural systems, "goodies in and garbage out" is perhaps a more appropriate aphorism. G.I.S. project management mustunderstand that thi@ir fancy system has the potential for being an expiditious means of doing very inappropriate things. They must also come anatorucally ecuiped to convey this message,to their superiors when requestea to "turn the crank". Selection of.Dersonnel offers the interesting situation of few being knowlegeable enough about (important) G.I.S-. subtilties to take the true measure of job.condidate s.' To aid the examination of personnel factors G.I.S.-related jobs are 152 lumped into three categories: director-programmer, programer operator,, and aata-entry specialist. (Three individuals coula potentially operate a G.I.S. facility during the early aevelopnent stages, but an operational system would probably require more.) The size of the G.I.S. staff is dependent on a host of factors not yet identified or fixed. 153 SICILL EEQUUM ITS A R REA OF EXPEPULITUE OR MTOWLEGE PERSO1.1NEL CATAGORIES 'D VATA F21TRY DMEE C TO, PROGRA2.11,1ER PROGRA1,51M OPER&TOR SPECIALIST PROJECT MAITAGEMM,T RDOIRED HE LPFUL GEOGRAPHIC DIFO SYSTEMS R-EQUIkED REQUIRED ?ORTRaN PROGRAIVII-M REQUIRED '2ZQUIREIJ SYSTE14S ANALYSIS RFJ@UIRED REQUIRED MIZE BKGD SOIE BKGD SYS TEEMS PROG-TWEMIG f=UIRED aEQUIREl) REQIUI@cm RB@UIRED E IF U-J-HOUSE IF rl-HOUSE HELPPUL uOMPUTER MRATI(IT S YS TE M SYS TEll DIGITIZER OPERATION R-rYJUIRED RIWIRE D =UIREu K=uHING MPFUL RBQUI-HED Soill@ BKGb PLAN1MG REWIREv HELPFUL HbLPFUL SOME BKGD u.ARTOGRAPHY @c GEOGRAPHY REQUIRED HEZP@UL 1-C- LPFUL UL SO(,IO-ZCQ401.IIU/CMTSUS JDATA ANALYSIS REQ=-,ED HELP@ SOM6 3KGD SOIXE BKGD STATISTICS RM@UIRZ- D 13-16 Q U I REE i) S 01-fE BXGD TL MOvELDM Ze"SUIULATION REQUIRED H-zLPit FORESTRY @c AGRICULTURE HELPKJL =,PTJL I TE L PF U L GEOLOGY & SOILS HELPI.UL HELP.-UL HELPFUL LD-9,10LOGY HELPPUL HELPFUL HELPFUL MaIRME & ESTUARINE SYSTEMS HELPMU FlELPFUL HELPFUL TTATU".",RAL SYSTEEM EUOLOGY HELPPUL HELPbUL HELPlUL S 01.1E. EKGD Ra-10r, S1,12"'ISDIG REQUIRED HM-P7vJL HELPFU 154 G.I.S. team meTr@bers can probably be round in Maine (especially if advanced notice of position openings is given.) The director and some of the Drogramming staff will probabl not be current State employees. If ly the "right" people were already oi 1 the payroll, this report would have been unnecessary. Some programming staff members and the data-entry spec- ialists should be drawn from current State service for their knowledge o.L' agency operation and working familiarity with the (conventional) methods that will be replaced by G.I.S. functions. User-agency personnel who will become involved with G.I.S. operations will probably not be selected for that role because of their G.I.S. exper- tise, but because of their need for G.I.S. services. This lack of supply oi'knowledgeable users must be dealt with by training and by the creation of specialized G.I.S. interface programs that "speak the language" of the particular diciplines. Non-State personnel may occasionally be required in a consultancy ca@ pacity. A small number of individuals associated with the University and witii private concerns are potential G.I.S. advisers. Remote sensing capa- bilities are available from the Universimy although little associated com- puter processing capabilities are,in place. Regional science, mathematical geography, and cartography skills are at best in short supply. Survey and cadastral specialists are available from the University. The supply of application area (wildlife, forestry, ecology,...) experts will be some- what diminished by problems of working in a new milieu, but those willing to adapt will be able to tap the substantial potential of G.I.S. systems. Survey of Available Hardware The equipment options -must be'identifiea ana documentation of specific 155 performance characterisitics obtained. The process should look for ranges and trends in cost and performance initially.anu- Only secondarily identify specific products and companies to monitor. 5.3 Survey of Available Hardware The monitoring of the.general purpose computer and computer graphic equipment market places is a necessary component of the planning process for the implementation or a G.I.S. This on-going process will be required to.g-auge the price trenas (for the most part downward), price-performance ratios (usually decreasing), and available units of function (some incre- ments may not be a convenient "size"), and to give early notice of ttle in-pact of new systems and emerging technologies. The pace of innovation in this field is truly astonishing. -A system chose either on the basis of cost or performance for G.I.S. purposes today would probably not be the s ystem selected a year orpossibly only six months from now. If G.I.S. equipment selection is not made with an eye to system "extensibility", the tiarclware will rapidly become (relatively) expensive and ineffectual in an environment of increasingly complex aria voluminous demands and of de- creasingly expensive newly-introauced equipment alternatives. The following subsections pro-vice a thumbnail sketch of general pur- pose computer gear ana computer graphics equipment potentially applica- ble'to a G.I.S. system. Machinery currently available or soon to enter the market is describea first, followed by potential developments of note. The heart of ainy currently conceivable non-research G.I.S. facility is a stored program general purpose computer. This appellation covers a wice range of systems from micro--computer systems costing a few thousand dollar s thru mini-computers costing on the order of a hunared thousand dollars to maxi-com-outers costing millions.. All of these t- es of compu- yp ters are potential candidates for utilization in a G.I.S. system. The 156 choice aeDends on the mix of tasks to be presented to tne sys tem. The most common maxi-computers are those manufactured by I3M, Honeywell, Bur- roughs, Univac, Control i)a'Ua, and Digital Equipment Corporation. These 9 machines are characterized by hi h performance, the ability to handle large problems, the availability of extensive systems and applications software, good.maintenance service, anu high price. The processors of these systems typically, perform ..5-15 million arithnatic operations per second on "words" of 32 to 60 bits. They have the ability to rapiuly access large amounts of data stored on magnetic disk and tape subsystems, to 'simultaneously' serve many users, and to respond to hardware and soft- ware errors in a so-Dhisticated manner. Over the next few years maxi-computers are likely to change slowly in price and capabi I ity.' IBIA, whose market position s1lows it'to strongly influence ths'profitability- of other computer companies innovative pra.- I ducts, has apparently adopted an evolutionary t ransition from its curren-c@ 370 series to its as-yet-unnamed successor line. Host other large system vendors will probably follow suit. The next 'Live years will undoubtedly bring performance/price increases of 100-200% based on cheaper main mem- cry and higher density conventional magnetic tape and disk secondary stor- age. It is not likely that a Imaxil system would be heavily used in any Maine State G.I.S. facility beyond the prototype stage. The amount of automated cartography and associated spatial analysis tasks is not great enough to warrent a dedicated mainframe ana substantial economies are to be found in smaller systems. The cost of computation on a =,i or micro system can be more than an order of magnitude cheaper. Such a smaller system offers a much simplified i'systems programming' and operational environment also. 1@7 Mini computers are, today, often mini only in price. They span a very wide range of caDabilities which now overlaps a substantial portion of the realm of maxi comDuters. They usually operate on words of 16 bits, although 12, 18, 24, and 32 bits are also sometimes used. These machines will Droduce .1 to 6 million operations per second. In recent years the variety and quality of systems software (operating systems language lilities) available for mini systems has come to rival processors, ana uw that available on maxi mainframes. Also, high performance peripheral equipment originally designed for maxi systems has been made available on minis. Micro-computers, designed to operate on words of 8, le, or 16 bits are now.finding their way into data processing and computational situations which ten years ago were the realm of maxi-computers and which more recent- ly were the baliwick of minis. They can perform up to 4 million operations/ see. Micro-comput-ers are-manulactured as chips.by the major semi-conductor iirms and sold as systems level products by them and a large number of other firms. Increasingly powerful and flexible systems are being marketed including micro-computer implementations of-existing mini instruction sets and bit-slice chip sets for flexible design of rather powerful systems. The peripherals usually available for general-purpose computer sys- tems include magnetic disks, nagnetic tapes, punch caras, and punch paper tapes for data storage and line printers ana character-printing terminals CD for.hara copy. A wide range of disk subsystems exists from floppy diskettes, thru cartridge disks, to high capacity 3330, Winchester, and Stoi-age Mod- ule devices. The floppy diskettes typically store .25 - 1. million char- acters of information (denoted 114bt), can randomly access any informatlon ofthe diskette in about 300 milliseconds, and transfer data at a rate of 30,000 to 100,000 characters/second. They are most often found on micro, mini, and intelligent terminals ana data entry devices. The cartridge 158 disks usually contain 2.5-20 Mb,have an access tine of approximately 50 data at a rate of several hundred thousand milliseconds, and transfer characters per second. Such systems are usually found on mini systems. The 3330, Winchester, and Storage Module type systems offer storage cap- acities of 40-300 Mb. access times of under 35 milliseconds, and transfer rates ex ceeding 800,000 cnaracters per second. These subsystems usually have sophisticated controllers for maximizing performance. These high- capacity disk subsystems are avai lable on most maxi, many mini, and even a few micro computer systems. Prices for a dual floppy system range from $1500 to $4500 with controller, cartidge disk systems run from $8.00 to $20,000 similarly configured, and the high-capacity systems go for $ 15,000 to $ 50,000 with a few actually under $7,000. Magnetic drums and fixed, head-per-track, disks acting as a slow, but. low-cost, extension of main memory are also available for many maxi and mini systems. They are usually special purpose subsystems with fast access anu transfer capabilities.[ Recently three other technologies have been adopted to this task. Change-coupled devices, bubble memory systems, and a form of magnetic care memory. The number of "disks" of any of the above types that can be attached to a given computer depends on the particulars of that computer. Typically for 'minis' four disks or disk subsystems is the limit. For 'maxis' the maximum amount of disk storage is usually much greater. Typical maximum disk storage capacities are .5-4.0 Mb for micros, 40-1200 Mb for minis, and several thousand No for Maxis. The costs of disk subsystems vary widely. A dual floppy system with controller can range from $1,000 to $4,000; a dual cartridge System with controller runs about $18,000; a dual Storage Module system with controller goes for $40,000 to $70,000; and a Winchester-technology dual fixed heaa system with controller costs about $10,000. 159 Magnetic tape systems are usually based on one of three media-standard 1/2 inch tape, 1/4 inch (3M) cartidges, or cassettes (similar to audio cassettes). The standard inch tape is the basis for most tape systems for maxi and mini systems. A' variety of recording standards exists with some systems working with seven tracks (200 bpi (characters per inch) & 556 bpi) and others with nine tracks (800 bpi, 1600 bpi, and 6250 bpi). The tape drives which read and write 1/2 inch magnetic-tape usually process only a single.. mode and density with nine track 1600 bpi being the most common capability. The 6250 bpi and multi-density drives are most commonly found on maxi sys- tems. The maximum storage using the standard 2400 ft. reels is about 40 Mb (at 1600 bpi) and 160 mb (at 6250 bpi). Transfer rates vary from 20,000 to 200,000 characters/seconu. The cost of a single arive with controller va- ries from $10,000 to $40,000 depending on speed, flexibility, special features (such as automatic tape loading), and controller sophistication. The 3M cartridges and cassettes are most commonly found on small-mini, and terminal equipment, although in some cases high-density 3M cartridges have been used on larger minis to replace 1/2 inch magnetic tape where trans- port ability was not required. Generally a 3M cartridge will contain .3-3Mb and the cassette .1-.3Mb with transfer rates around 50,000 characters/second for the 3M cartridge and characteristically much less for the cassette. Prices for dual drive systems with controller are approxi mately $3400 for the cartridge and $2000 for the cassette. Punch cards and paper tape facilities should not be considered for inclusion in a "modern" system unless the implementor requires them to in- terface existing equipment such as a keypunch pool's owned punches or lab instruments producing punched paper tape or if they are necessary for communication with "foreign" systems. Alpha-numeric terminals for data entry, inquiry, & time sharing, pro- 160 duced either as printing or URT (television-like) devices, are available from a variety of ventlors. These terminals mtLy be utilized proximate to the com-outer or remotely over teleDhone lines or other communications links at speeds of 11 to 960 charlacters/second. Many have elaborate edit- ing features, local storage., somd local processing capability., and ability to interface to a variety of diff erent systems. Most uRT terminals offer a display of 80 columns by 25 lines while the printing terminals reproduce the line printer format of 132 columns. A few of the "dot matrix" and "daisy wheel" types of machines p Irovide graphics facilities. Prices for quality CRT's start under -olOOO while good printing terminals begin at l_,)OO. Beside these "conventional" peripherals, there are other devices that should be noted, especially in the G.I.S. context. Speech recognition and voice response units are currently available, functional in cases where limited vocabularies are sui Iiicient, and potentially cost effective. A digitizing station utilizing these capabilities would probably be twice as productive as one without them@. A digitizer operator needs to carry on a complicated series of status,inquiry, status setting, and data edit- ing operations but he often is not free to use a keyboard device. A voice facility would allow these functions to be easily executed. Commercial grade voice input systems cost about 4512,000 but the existance of hobbyist system with nearly tne same capabilities and a price of @p 300 bodes well for the future. Voice response systems cost from $500 to $7,500 depend- ing on the technology employed, sound quality, size or vocabulary, and operat-iori@l Very sedate dial capabilities should be avail- able for 61,000 within a couple of years. Integrated computer graphic Systems marketea for automated cartograplh@7 are produced by Broomall, Benaix,'Calna, and ADpliCon. These systems tend 161 to be more oriented around com-outer assisted draftins than data base man- agement of graphic entities. In most cases the cata structures utilized by these integrated systems do not provide the flexibility required for gener al planning cartography nor'do they provide direct compatibility with likely GIS data structures. Due to the@low sales volume or such systems their prices are high relative to the value of soft,'@Tare ana hardware com- ponents. Prices range from $125,000 to 300,000. Integrated computer graphic systems marketed for general purpose graphic work offer better performance/price ratios than systems targeted for cartography.. Packaged general purpose systems and locally integrated graphic peripherals are currently the most auspicious approaches to de-,,rel- oping the graphic capabilities of a GIS implementation. Tektronix produces two systems of potential utility in a Maine system. The 4051 is a micro- processor (Hotorola 6800) based.graphics system employing a storage tube, cartridr-e tapes, stand-alone BASIC language capability, instumentation face and PS communications. It provides adequate capabilities for an intelligent-digitizing station and can also act as a gra phiQ termin- al. The Tektronix 4081 'is a stand-alone and terminal emulating system based on an Interdata,mini computer, large screen combination storage/re- fresh graphic tube, disk storage, and communications. It offers more capability than the uO51 but suffers from a lack of flexibil ity imposed by the.proprietary operating system. Both DILAC anu Megadata sell refresh- ed high resolution graphic systems.which avoid this problem. Each employs a mini computer running uncier the mini.vendor's standard operating system and supporting all standard peripheral dev--ces and standard software in- cluainc- Fortran. Uomtal corporation offers on LSI-11-based pixel-orient-ed color -oicture processing system that has sufficient resolution (102L@=24) to allow it to function in both a line draiiing and an image processing -415,000 to @?@50'000. role. The prices for the above systems rance from 162 The following paragraphs provide a cursory view of computer graphic input and output peripherals that nay be added to most graphics systems or may be used as components in a locally designed graphics facility. Computer graphic input peri@herals and data acquisition systems con- vert informati on contained on photos, charts, and maps and signals received from objects in the natural world into a form that can be processed by di- gital computers. In a GIS context the most common systems are tablet di- gitizers and remote sensing systems. Tablet digitizers are available in a variety of forms. (some are essentially modifiea draftin@, machines and should be avoided because of mecrLanical problems anu high inertia of the cursor assembly.) 11-10st di- gitizers touay consist of a flat@surface on which the object to be digi- tized is placed, a cross-hair cursor or stylus for selecting points and generatinR.status information, and an electronics assembly for control of degitizer operation and interface to a local computer, a local storage me- dium (cards, tape, floppies, cassettes2 or punch paper tape), or a com- It 4.0 munications line. The wor?dng areas range from 1111 "1 11 t 42 X @60 with accuracies, resolutions, ana repeatabilities from .01" to .005. Common options are multi-button cursors and menues for increased flexibil- ity in entering command and status information2 backlit boards for use with photographic negatives or for forced interpolation to a base map, P'C rear projection capability for extracting information from slides. Prices for tablet digitizers without interface or auxilliary recording media range from $3,000 to $12,000 depending on size, metric capabilities, and special- features. Among the vendors offering tablet digitizers are Talos, Science Accessories Corp,, GTGO, uEu, Summographics, Bendix, and Computer Talk. Optical line following aigitizers provide accurate, high-speed cap- ture of line data. They require@meticulously prepared graphics,as input 163 anct are only partially automatic, requiring human assistance for classif-L- cation of data and for rectifying pathological situations (perfidious nature, careless draftsmen, and stochastic dirt as still more inventive Man the programmers hired to create.automatic digitizing systems). These devices are ver-.r expensive. Raster scan devices for optically digitilzing source television cameras (especially those having capacitive discharge recording systems), scanning microdensitometers, ana Raster scan photo cell arrays. These systems render a map, chart, or photographic product as a two-dimen- sional array of intensity values. The raw output must be reorganized to represent a new array of pixels confornin to the metric of the database 9 or meaningfully aggregated into data structures representing line and areal entities. In practice this task is very difficult except for "pixel" oriented systems where input graphics have been manually produced in con- formance with the system's metric. -ht pens for use with Other devices used'for graphic data input are lig raster scan tubes ?z joy sticks, track balls, and thumb wheels which are options on both refresh ana storage tube systems for positioning of graphic cursors. Touch sensitive screens and function pane ls may also be used with both types of display devices to control graphic variables. Remote sensing systems such as UNDSAT spatially distributed environ- mental monitoring systems, ana even (soon to be deployed) point-of-sale and electronic funds transfer systems can be considered generalized gra- phic peripherals. Real-time spatially distributed computerized monitoring systems are likely to proliferate in the near future, blurring the tradi- tionaldistinction between the data collection process and the data analy- sis function. Computer graphic output peripherals provide aisplays of point or line 164 line data temporarily on the face of electronic instruments or perman- ently on paper,drafting media,, scribing material, and various types of "photographic" film. The two traditional systems, electro-mechanical plotters and cathode ray tubes, have been augmented in recent years by a large number of new technologies Two basic types of pen plotters, drum and flat-bed, are produced in a wide variety of sizes, capabilities, and prices. Drum plotters are usually found in 11" and 34" widths with the length of the plot limited by the length of the paper (up to 150'). The plotters' pen carrage may nave as many as four pens. The types of pens usually offered include pressurized ball point ana capillary action pens for use with paper, in- dia ink pens for writing on paper or drafting film (mylar, etc.), and, in some more expensive units, scribers. Prices range from $3,500 to .,more than $15,000 for drum plotters without interface. Flat-bed plotters are produced in small (11" X 17") timesharing units and large (34" X 34") high-accuracy devices. Ball point, fiber tip, anaindia ink pen systems are supplemented on large flat-bed plotters by scribers and photo plotting heads. The accuracies on both types of plotters exceed .0025" for mod- erately priced units. Plotting speeds run from 2"/sec. to 40"/sec., but for most cartographic applications speeds above 10"/sec. are not useful because of the high percentage of short rectors. (It takes a while to accelerate to maximum speed.) Prices for the small flat-bed plotters are around $4,000 and for the large models range from $15,000 to well over $50,000. Interfaces for plotters range from RS- 232, custom mini interface cards, and time-sharing devices for code compression and hardware graphic generation, to off-line controllers utilitzing magnetic tape to supply plotting information. Controller prices range from $1,500 for RS-232 and some mini interfaces to more than $15,000 for off-line units. Vendors of pen plotters include Houston Instruments, Calcomp, Broomall, Zeta, Gerber, 165 2c H. Dell il'oster (KE) amongst others.. .1@1;lectrostatic plotters utilizing a single color pattern of dots (with spacings of .01 to .005) are available in a variety of widths up to 72". Plotting speeds are around 2 inches/sec but vector to raster conversion prior to plotting may consume asubstantial amount of time. Prices range from @p8,000 to more than @@5,000 for the plotters and controllers are priced approximately the same as those for pen plotters. Electrostatic plotters are not as useful in small-scale cartographic operations as are pen plotters, but where fast, multiple-copy, monochrome plots are required or where frequent storage tube hard copies are required, they may be use- ful. The hic-h cost of speciall paper and supplies also restricts the cir- cumstances in which tney are cost-effective. Versatec, Gould, and Varian are the best-Imown electrostatic (printer-) plotter manufacturers. Inkjet plotters, available from Herz and Applicon,.provicle specta- cular color capabi lities on plots up to 2211 X 34" utilizing a variety of media. Areal plotting (coloring) capabilities are considerably superior toline drawing capabilities, but for thematic mapping it is a nearly ideal low-vol-wme device. Prices are approximately $50,000 with off-line controller. Display screen devices, in common use are storage tubes, CRTs, and gas discharge (plasma) panels. Storage tubes retain the image painted on the screen phosphers. This allows a complicated: pattern to be. built up over an extended anu permits the use of a low-speed interface to the computer. Changes to any. component of a disployed plot, however, require the entire screen to be replotted. Uonnion resolutions for these vector-oriented systems are 1024 X 1024 and 4096 -1. 4096 adaressable Doinus and screen sizes range up to 1911 diagonal. Uormunication with the proces- sormay occur at up to 9600 baud iflocal or transmission constraints ner- 166 mit. Prices run from $3,500 for minimal models to over '15,000. Hybrid devices having both storage tube and refresh characteristics exist as part of higher priced systems. The two primary vendors of storage tube devices are Princeton Electronic Products and Tektronix. CRT graphic display tubes are produced with vector and raster dis- ms more naturally conform to the opera- play methodologies. Vector syste tions of cartographic line drawing while raster scan devices provide a more appropriate mode (especially in color systems) for areal definition. Vector CRT resolutions vary from 512 x 512 to 4096 X 4096 while raster displays usually have approximately 1000 scan lines for high quality units and about 500 or 250 for less expensive units. Both vector and raster CRT's must have any image on their screens displayed at a rate of 30-60 times a second or the observer's eye will be irritated by the flicker. This refresh capability requires that the "memory" for the display not be the physical properties of the screen phosphors (as it is for storage tubes) but be RAM or CCD either within the refresh graphic terminal or a short distance away in a computer ana coupled to the terminal by a very high speed direct memory access scheme. Vector systems allow dynamic changes to be made to a line drawing which is distinct advantage in an editing situation but they also are limited in the amount of' detail that can be shown at a given time (due to the amount of memory available for refresh and the rate of which refresh must occur). Manufacturers of vector-oriented terminals include IMLAC, Tektronix, Hughes, Calconp, Vector General, Generall Turtle, and Megatek. Raster-oriented display are produced by Ramtek, Lexidat a, Aydin, Data Disk among others. Prices range from $3,000 to over $30,000 depending on features and options. Plasma display panels consist of an array of points of light that may individually be switched on and off. The screen does not require refresh- ing and thus may be used with low-speed serial interfaces to local or re- 167 mote computers. Panels are normally arranged in a 512 X 512 Point array with 60 points/inch in both dimensions. Plasma terminals are manufactured by Magnavox, Applications Group, Burroughs (alpha-numeric only currently), and several military systems vendors. Line printers and printing terminals may be used as graphic output devices in circumstances where they offer sufficient resolution. Pen plot- teremulation programs are available and many mapping packages exist with output designed for rapid line printer review as well as for high resolution slow pen plotting. Recently dot matrix terminals and line printers have come onto the market with impressive plotting capabilities. The only de- vice capable of meeting the volume demands of a GIS situation is Printronix's 300 line/min. printer-plotter. This device plots 60 dots/inch horizontally and 72 dots/inch vertically at a rate of 17 inches/min. on standard line printer stock. (One-twelvth inch squares may be employed where equal spa- cing is required.) The price is approximately $6,000. Other less common graphic output, equipment includes dry silver and electrostatic devices for making copies of storage tube images ($4,000- $10,000), film plotters such as those produced by Dicomed, & laser & elec- tron beam recorders manufactured by Gerber and several other firms. Sev- eral computer output microfilm systems have plotting capabilities in addi- tion to their hardware generated character sets. Typically resolutions of 16,000 x 16,000 or 32,000 X 32,000 points are available. Both raster and vector oriented COM systems may be used, but vector systems offer very substantial savings for cartographic applications. Since these machines cost $150,000 to $300,000, a GIS facility would contract for this service. A number of technologies offer future potential for GIS systems. No reliable projections can be made about performance characteristics, intro- duction date, or prices ana certainly the citations are incomplete. Any continuing GIS implementation study or active should maintain GIS group 168 an ongoing profile of emerging data capture, storage, processing,. @c display technologies. In terms of GIS systems the@most im-portant development likely to be introducea in the next few years!is the video disk. Experimental versions of both analog and digital systems have been produced and mass marketing of the analog system is imminent.@ The digital version should offer the same access and data transfer capabilities as conventional disks but are capable of extremely high storage@ciensities and exceedingly low costs (due to the economies of scale provided by the closely-related home entertain- ment systems). One to perhaps 25 1@billion bytes can potentially be encoded on a single (less than $10) medium and read or written using a device cost- ing substantially less than '05,000. Initial versions will probably be write-once, but substitution or optically-switchable recording media for holes burned in an aluminum substrate by a laser should eliminate that re- striction. In late 1977 a lower b'erformance photographic (central record- ing only) system was announced that was in many ways analogous to video disks. Large screen displays based on liquid crystal, light-emitting diode, 0 plasma, or other technologies may@become practical within the next decade. I Fiber optic transmission systems, some already being field tested, will soon offer very fast data communication capabilities that will make possible networking, database sharing, load spreading, and other activi- ties designed to maximize the performance of whole systems of computers and to provide-alternatives when individual processors or peripherals fail. Point-to-point digital satellite service is scheduled for 1981. This will provide @Hirtual site independence for components of networked systems. Potential transmission sneeds for these systems are several orders of ,169 magnitude greater than todays best data transmission facilities. The tech- nical issues in future data communications capabilities will likely be sub- serv-ient to political and legal issues. The boundaries between co=uting and telecommunications and between the 3ell -Systens and the specialized. carriers will occupy the time of Congress and the Courts for a long time to come. Ovonic graphic media will come onto the market in 19"8 with 3M's in- troduction of a dry micro-fiche system with the remarkable property of be- ing erased or written a single frame at a time. The potential of this type of material for computer cartographic output is substantial. This mater- ial also has potential utility as a digital or analog (holographic) means of data.storage. Optical computers have a remote potential for very fast operation whicn-may exhibit a high.degree of parallelism. Lack of sufficiently high-grade optical components is one of the major impediments to progress in this area. Very fast processors and memories may eventually,emerge frcm IBM's work on Joseph Junction dev ice s ana.from Burroughs? development.of ovonic circuitry. 1,1ost advances.in computer power in the short run are likely to result from the incremental increases in speed as circuit den- sity goes up and from various forms of parallel operation. 5.4. Availability of Personnel The capabilities of personnel must be weighed against nersonnel re- quirements needed to develop, maintain and use a geograpl-Lic information system. Good background on existing capabilities., including tne ability of po tential system users todeal with their problems in a quantitative manner will provide the basis for personnel planning. 170 Some Hypotnetical Alternatives If the State were faced with the immediate requirement of developing a Geographical Information System facility for itself and possibly also for the Regional Planning Commissions and the towns, it would not be pos- sible to evaluate all of the possible organizational settings for such a service nor would it be feasibi e to exhaustively scrutinize the current and future software and hardware@considerations. In such circumstances the course of action would have t o be determined on the basis of the per- sonal judgement of the State ana!consultant managers anu technicians sel- ected to carry out the mission as, opposed to the slow, detailed, analyti- cal approach espoused by the bull@ of this document. A GIS system derived in the former manner would not necessarily be an inferior product. indeed, a skilled manager operating on his 'instincts' will often proaluc@! a better- ommi integrated product than will a c Ittee, composed of diverse interests, whieft spends its time examining details. The following subsections des- cribe one such scenario and discuss the costs and other parameters asso- ciated with securing the required, GIS functions from different sources. The personal opinion of the,author is that there are two viable ap- proaches to a near-term commencement of GIS projects: a low-level approadi utilizing University or consultant services and equipment Zc a mic-level approach involving in-house or consultant systems. No high-level approach is suggested because of the costliness of such an operation, the rapid changes of performance/price ratios, and the lack of trained personnel. In the following subsections only the mid-level ap roach is ouulined. The n -P low-level approach would consist of a restricted subset of the @mid-level set of capabilities whicft, though probably not as cost effective on a per-unit basis, might well provide the least costly path to an eventual fulli-blown GI3 facility at a time when equipment was relatively cheaper 171 and the technology was more mature. The choice of tasks that woula con- stitute the low-level approach would depend on State priorities and the availability of particular graphics,. computatio n, soft-Ware, Rt personnel resources in the region. Uosts for the low-level approach would run from $24-5 OKper,year for personnel (1.5) FTEE's computer time, equipment rental, etc. for research to keep abreast of the field for the produc- tion of some of the more basic products cited in the mid-level profile. Additional requirements would result in corresponding higher costs. Se, veral very specific GIS tasks might be undertaken on a project basis for under @25K each, but pooling such jobs in a continuing series of related jobs,would probably be more fruitful. The choice depends on the parti- culars. Purposes of a 114aine GIS k-'acility (Mid-level) A mid-range State GIS facility would provide a "local" capability for storing and analyzing spatial data for meeting State needs and antici- t em pated Federal requirments for manipulating natitral sys@ s information. It would have the potential for becoming a cost effective replacement for or assistance to conventional cartographic operations. Given the drift of cartography towards an eventual digital constitution and, in particular, USGS's long-range plans, the implementation of a State GIS capability would commence the long processes of user training and infrastructure dev- eleopment that would eventually allow Maine to enjoy the benefits of this new technology. This State facility would have the ability to install the USGS's LUDA software and database for experimental and production purposes. It would also support the development of an in-house GIS or the modification of a "roreign" GIS to meet peculiar Maine needs. (Different data structures 172 ma7 be needed to optimaaly accom'date the local topology; different file 0 organizations may be required to,handle local inquiry anc utilitzation patterns.) An existing system would allow the State to make "live" eval- uations of software developed at,othe.- sites. This local evaluation is imoortant as many GIS operations,are "data dependent". (For instance, a system. develope d in the 1141id West!and tuned for the mostly "rectangular" land use geometry prevalent there may perform very poorly in a Maine con- text where irregular natural features are more likely to influence land use.) Production of.a number of useful planning and management tools would be possible with a computer-based' GIS. Maps and other graphics such as cartograms, graphs, and charts may be rapidly generated from the GIS data base. Similiarly, reports and statistics may be very quickly extracted fully implemented GIS. A well-designed facility from the data base by a I I can handle most tasks of this kind hundreds of times faster than a drafts- man or clerk and at substantially reduced unit cost. Uonventional cartography in'the near-411-erm can benefit from being pig- gy-backed onto a GIS effort in many cases. Chanae of scale elimination change of projection, rectification of maps of of systematic distortion, varying "metric" and many other activities may be under taken digitally when maDs have been digitized rather than, or in addition to being drafted. Another piggy-back benefit of a mid-level GIS systen is a general purpose con-putational ana data storage capability. in a planning context t this would most likely be utilized for maintaining and processing of socio- economic data bases, word prcessing (or an archieval interface to a distinct word processing system), running;selective mailing 11-ists, inde--dng documents, facilitating-: communication by routing information to agencies and citizens 73 according to catagories of interest, and innumerable other c-emeral lo--ical and computational tasks. A mid-level GT S capability would include graphic data entry capability via on-line manual digitizer for low volume capture of original grapl-de information and for correction and update of the various geographic data- bases. The capability would also be present for future expansion of manual digitizing.and for inclusion of automated digitizing equipment. Another major purpose of a mid-level GIS system is the support of re- search. The modeling of natural and.man-made systems is necessary to allo-w the mass of data potentially available to decision makers to be organized in a hope�@My meaningful (dynanic) logical structure. The presence of a State controlled GT Is would Drovide a common database for modeling efforts and potentially allow modeling efforts to be more useful. than heretofore has been the case. University, contractor, and agency simulation, whether theoretical or practical, coula all 'be consolidated in one place. (Con- tractor work. could actually be done on the State system to guarantee prop- er performance of the software on the "target" machine and to monitor progress.) Hany issues in theoretical cartography ana spatial analysis axe unanswered and a State GIS facility open to scholars would be benefi- cial to both parties. The research opportunities that would arise from any tag-along socio-economic database facility are likewi-se substantial. If the GIS's computer facilit-y were also used as a means for routinv technical (and other) information, a interesting opportunity would exist for*research in organizational behavior and program evaluation. 5.5.2 Hardware and "System" Software Requirements for a 7,1aine GIS Facility (mic-level) Any computer system considered for the heart of "Tie GIS facility should 174 be able to pass the following tests: 1) Substantial computational capacity at low unit cost. 2) Ability to run very large programsas logical unities. 3) Ability to serve severallsimultaneous-local and remote users. 4) Sufficient surplus capacity to allow a hiEh-level language approach to programming and a minimum o f systems programming. 5) Capability to run low priority monster jobs (such as polygon over- I I laying) in "background" at very low cost. 6) Unattended operation. .Hi.gh performance is a must in the areas of graphics and simulation. The computer should be optimized for floating point computation. A fast processor, the availability of fast cache and scratchpad memory and the presence of user accessable wribet Lble cord"rol store, 32 bit (vs 16 bit) AUJ and memory data paths, memory inter-leaving fast error-correcting main memory, and a high-speed floating point processor are all definite plusses. Several large =,i computers come very close to this profile. Another requirement is the ability to execute a large program as a single logical entity. Geographic applications often require very large arrays that often can't be conveniently @andled in parts and that won't fit in any resonable efiiciency in a mucn smaller amount of real memory. The disk re- quirement would probably be met initially by a 40-801,Ib drive, although sub- stantial expansion may be necessary in tine. Tape needs are for one 9-track 1600 bpi drive at first with the possible option of 800 bpi capability for compatibility with equipment at other sites. A 300 line per minute printer will sifffice, with very long print jobs being handlea at sites with hi@,n-. speed printers after transmittal on tape. Some 3COIPM printers also offer dot-matrix Dlotting -for quick review plots and Zor uRT hard copy. System software requirements'include a virtual-memoi-,r time-sharing 175 System software requirements incluce a virtual-memory tLme-sftiaring operating system, language processors, utility programs, and (bending the nomenclature slightly) basic graphic, statistical ?z modeling packages. M Ihe operating system should 'be a single integrated facility for control- ling all system hardware and software resources and for providing simulta- neous access to these capabilities for several users who may wish to com- pile, edit, execute or debug programs in a variety of languages. The OS's support of virtual memory should not place any 64K limits on codeor arrays-, and the supervisor proprogram should provide for dynamic allocation of system resources, device-independent input-output, and a tree-like direc- tory based named-file system. The operating system should privide pasS@- word-controlled capabilities and file access for security against inten- tional or accidental intrusion or file destruction. Jobs s1i"'UIC-1 to be spawned by terminal users for batch execution.. or tney should be able to be triggered by time or the Completion of other jobs. Unattended batch operation should be provided. User extensibility of OS features and control file capabilities are considered advantages. The operating system should also possess an integrated set of utili- ties for sorting merging, anu copying files. A convenient software facil- ity supporting the writeable control store hardware is necessary if the user is to be able to make use of this hardware. In addition, vendor commitment to industry-standa-rd networking protocols will possibly provide future oper- alional and maintenance advantages. Other system software requirements are for an Editor capable of handling large files.and working from scripts, and for a symbolic debugger usefull to non-syst-ems programmers.' Fortran is the language of choise for most GIS and related tasks all, -L Present. A11SI Fortran IV is adequate, but syntax equi,.ra'lent to 13111 G-level 176 Fortran is to be preferred. A macro assembler is needed for the coding of those few, frequently-used, routines whicq do not result in el-1--ficient code when written in a higher order language. PL/I, though not necessary, would be a definite olus if available b offering the potential for implementing such systems as Canada's CGIS. Likewise, PASCAL would be highly advanta- geous if it were available. It offers a nearly ideal means of dealing with .cartographic data structures, tho its usual semi-interpretive implementa- tion tends to run slowly. Statistical software needs can be met for the most part with SPSS, BIZOP, and D, fSL or SSP. Modeling efforts will require discrete, continuous, and mixed simulation capabilities which can be pro- vided by (Fortran-based) GASP IV* @ Data 3ase 11.1anagement Systems or 1-aTHPS (especially if it runs under theTegular operating system) -may offer signi- ficant advantages in maintainingisocio-economic and graphic data bases, generating reports from these data bases and providing a convenient inter- face between user programs and the system's data bases. The computer system need not be an in-house one as there are signi- ficant potential economies to be@aerived from a shared system or from con- tracting for specific resources. A cooperative venture with University agencies with similar interests a nd with the capability to meet the State's GIS-related research needs might be worth:7 of consideration. Another pos- sible means of savings would be t o contract for such a system from a private source during normal working hours, and have commercial work run on the system during off hours. This would potentially reduce the unit cost and would avoid the politics of replacing an owned (but obsolete) system in five years. An off-site computer JL7acllity could be supported from tem, inals over 4,300 baud voice-gr'ade phone lines or over hard-irired lines for an Augusta site in close proxi-laty. 177 The graphics subsystem for a mid-level GIS implementation would consist of irconventional" computer graphics peripherals meeting the particular and sometimes demanding requirements of computer cartography. The basic com- ponents of the graphic subsystem incluae display, plotter,land digitizer. fer The graphic display to.useful in a cartographic context should of.L a large viewing area (1911 diagonal is ideal), high resolution (4096 X 4096 points), the capacity to d-isplay a large number.of vectors (14000 minimum), and the-ability to rapidly display changes in data being edited. Refresh- ed graphics displays offer more advantages in this respect than do storage tube systems, but the latter may funct ion adequately if a very high speed interface is used. The plotter should be a drum or flat-bed "pen" plotter capable of plotting on paper or r,*rlar with a minimum dimension of 34". High speed is not required in a plotter used for GIS work because the nature of the data being plotted seldom allows this high-cost capability to be adequately exploited. The digitizer in the system should be a large bed (30'1TX4811 or 421U6011) back-lit unit operated on-line. ..In addition to the actual graphics hardware, a capability should be incluaeu to allow preparation of plot tapes for off-site C014 plotters, photo plotters, and inkjet plotters where their special features are re- quired or conventional printing is desired. The basic software r. equired to support the graphic subsystem includes a standard.graphics package (such as GD110-F, uISSPLA, or GUS) ana device handlers for the various hardware components of the system.. In addition, a contouring package should be procured and a series of graphic utilities s-liould be written-loca'Lly for editing, gridding, grid compositin1g, pol-yr:on generation from grids, and a number of other frequently used general func- tions. 178 5.5.3 Applications Software Requirements for a Maine GIS Facility (mid- level) The general considerations in the initial development of a GIS appli- cations software capability are to facilitate the productian of some useful products at an early date and to preserve flexibility by not getting too locked in to an approach or software package. The overview of the subject that is sketched in 1977 is not going to be very accurate in a decade and it will be a hardy creature indee if it is apt in five years. Adaptabil- ity and the opportunity to experiment with novel software approaches will be necessary until the technology slows its rate of change. The computer consultant personally believes that the most advantageous choice of Geographic Information System is the USGS's LUDA (GIRAS) System. GIRAS is not the best system currently available. In fact it appears that in late 1977 significant portions of the projected GIRAS facility are (1) not yet implemented, (2) not yet integrated into the system, or (3) not yet ready for dissemination. Still, unless the LUDA program has fatal flaws, or unless Maine needs to move immediately, GIRAS should be the pick over other currently better-developed systems. GIRAS will receive large amounts of money and effort in the next few years & Maine can expect at least some Federal support in implementing and maintaining GIRAS. This system will evolve over the next few years into a well functioning entity or it will be scrapped and replaced by one that does work. In either case the package will be "'supported" and will interface directly to the USGS developed data bases. If the State is able to attract quality personnel to its GIS effort and if it does not saddle them with administrative or production tasks i may be able to produce a superior home-grown system. This, however, is not likely to come to fruition. In-house GIS development a evaluation and in- plemenation of other GIS systems be undertaken, though. The "best" system may come from any one of these sources in actuality. Prior opine should 179 not carry more weight than the facts as they are revealled in time. The software for computer-assisted conventional cartography should be written in-house. and should make extensive use of the system graphics pack- age (GINO-F, GCS, This approach is potentially the quickest and neat- est. Most of the dog work has already been done. Statistical programs will most likely be written in SPSS or in Fortran with BMPD or SSP routines patched in. These programs would be written in-house to meet local require- ments. Similiarly, modeling software would undoubtedly be written by staff or consultants using GASP, Simscript, or Dynamo "tools" or possibly pure Fortran in an attempt to simulate Maine natural or human Systems. "Imported" simulation programs (as opposed to building block utilities or general frame- works) should be used only with the greatest care. Resource Availability for a Maine GIS Facility (mid-level) The equipment profiled in the preceeding subsections is readily avail- able from a variety of vendors. The computer systen (processor, memory, disk, tape, communication interface and printer interface) is currently available from Prime, Digital Equipment Corp., and Harris. Other vendors such as Interdata, Texas Instruments, SEL, National Semiconductor, and Data General are likely to introduce systems in this class in the next few years. Upgrades to Hewlett-Packards 3000 would allow it, also, to function in this role. Printronix offers the only 300 line per minute plain paper printer/. plotter, but Tally and Centronics may be expected to produce similar equip- ment. Electrostatic printer/p lotters are probably not cost-effective in this application although they do provide the dual functions. Graphics displays are available from Megatek, Tektronix, IMLAC and several other sources; appropriate plotters are offered by Calcomp, Houston instruments, zeta, and K&E amongst other vendors; and digitzers may be acquired from 180 .Summagraphics, Talos, and Altek. interfaces for integrating the graphics subsystem exist, in the form of industry standard RS-232C and I EEE-488 interfaces and custom interfaces for particular mini systems. System software is available from the computer system vendors or from software houses offering s"uch products as SPSS, IMSL, GINO-F, DISSPLA. Other items in this catagory,such as GCS, SSP, and BMDP are in the public domain. GIS appli- cation software is readily available 4rom the USGS and other public domain sources. The availability of good personnel is the most serious constraint on developing a wel I -functioning GIS capacity. it will be difficult to assemble a compatible, appropriately skilled, core team at a respectable price. The minimum staffing for �Maine State facility would be a d,irecLoor-programmer, a programmer-operator, and �digitizer-key-puncher with one of the first two individuals having some knowledge of cartography. An initial narrow focus of interests might allow a group of this -size to be productive, but volume work or 6 widened scope would require more personnel. 5.5.5 Operations of a Maine GIS Facility (mid-level) Beyond the initial period of hardware integration and implementation of a basic GIS capability the activities of a computer cartography group would be dependent on the then extant priorities of the Executive. Some of the likely candidates for early action are (1) developing a compute,r-assisted conventional cartography capability, (2) putting up the entire Maine LUDA data base, (3) supporting 306 projects with special data base development, modeling, and reporting, (4) providing Forestry with a means of spatially and tempo rally analyzing spruce budworm data, and (5) creating industrial siting and development evaluation capabilities. The computer 181 m The general consultant suggests that any GIS capability be initially 'aintained by the SPO. planning perspective would provide a more appropriate environment for the development of a GIS facility than would the more restricted milieu of DEP, Geology, LURC, or Forestry. The needs of. these other agencies should be initially met by a central facility under a coopera- tive agreement. The future may bring satellite systems, or even independent specialized units, but such alternatives should not be considered early in the development cycle. 5.5.6 Estimates of Costs and Associated Parameters by Source of Computing Resource (Mid- Level) The following tables provide subjective estimates of the major cost components of a GjS facility. No total is given as that could be misleading if accepted at face value. Sum- ming the major comp.onents would probably give reasonable approximations to true costs, but a m uch sounder set of figures could be derived by specifying the details left undefined in this hypothetical alternative. 182 COMPUTER SYSTEM QVID-LEVEL) PA RAM ETERS EQUIPMENT FIRST YEAR SUBSEQUENT POTENTIAl- ABI LI TY TO COMMITMENT FACILITY COM PUTI N G YEARS PROBLEM DEMAND PERIOD PROCUREMENT RESOURCE COMPUTING UPGRADE SYSTEM M ETHOD COSTS RESOURCE COSTS INCREMENTS RESOURCES C C ? CONTRACT 100K 100K YES NO S I N H LONG 5) BUY I qOK 25K 0 3 - 5 LEASE 50K 50K NO YES U S 3 -5 RENT 65K 65K E co U- M 3 -5 CONTRACT 100K 100K YES NO 0 S S 3 -5 CONTRACT 50K 50K NO POSSIBLE R I P R 0S I UT 3 -5 CONTRACT 100K 100K YES NO V TA A T 0E F P R S I T V IA 3 CONTRACT 40K 40K NO YES A NT T E E G 'AP ICS S -LEVEL) PAROETE RSYSTEM (MID COMMITMENT EQUIPMENT FIRST YEAR SUBSE UENTYEARS ____SYSTEM PERIOD FACILITY EQUIPMENT EQUIPMENT UPGRADE PROCUREMENT FACI LITY FACILITY MPTHOD rrAT rn';T C C LONG 5) PURCHASE $ 50K $ 7.5K BY NEW PURCHASE S 3 LEASE $ 25K $ 25K IN MOST CASES I N H LONG 5) PURCHASE $ 50K $-7.5K BY NEW PURCHASE. 0 3 LEASE $ 25K $ 25K IN MOST CASES U S E U LONG 5) PURCHASE $ 50K $ 7.5K BY NEW PURCHASE,, M 3 LEASE $ 25K $ 25K IN MOST CASES 0 S S LONG 5) PURCHASE $ 50K $7.5 BY NEW PURCHASE R 3 LEASE. $ 25K $ 25K IN MOST CASES P R 0 S I- U T V T A LONG 5) PURCHASE $ 50K $ 7.5K BY NEW PURCHASE A T LEASE I 25K $ 25K IN MOST CASES T p E E S I T LONG 5) PURCHASE $ 50K $7.5K BY NEW. PURCHASE V I A A N T 3 LEASE $ 20K $ 20K IN MOST CASES T E 3 SHARE $12K $12K E (Local site) SOFTWARE (MID-LEVEL) PARAMETERS (one-time costs) S APPLICA, ION SOFTWARE I.SYSTEM OFTWARE OS &COMPILERS GRAPHICS STATISTICS MODELING LUDA OTHER GIS C c $0 $0-15K $0-2K $ .5 -1 ol\ $0 NOMINAL, S AND UP I $0-15K $0-2K DEPENDING ON $0-15K POSS I BLE $.5-1 OK $0 NOMINAL, U VENDOR SPSS CONVERSION AND UP S NEEDED E -NONE-, - co LA $0 $0-15K. USAGE $0-1 OK $0 NOMINAL, 0 CHARGES AND UP FOR SPSS S NON E, S $0 $0.15K USAGE $0-1 OK $0 NOVINAL, R CHARGES AND UP I FOR SPSS p PROBABLY PROBABLY NONE, R 0 S I U T NONE, NONE, USAGE NOMINAL, V T A USAGE CHARGES USAGE CHARGES FOR $.5-1 OK $0 AND UP A T POSSIBLE CHARGES SPSS, POSSIBLY T 0 E E F LIK ELY OTHERS p R S I I T $0 $0 $0-2K $.5-1 OK $0 NOMINAL, A 1,dr) I IP V N A A T elm we m m m m m Eli PERSONNEL (MID-LEVEL). PARAN ETERS (Annual costs) IN-HOUSE SUPPLIER OF COM-PU RESOURCE PROJ. DIR.- PROGRAMMER DIGITIZER- OTHER MANAGEMENT SYSTEM-S RESEARCH PROGRAMMER- OPERATOR K.EYPUNCHER CARTOGRAPHER SYSTEMS C C $18-25K $13-18K $7.5-1 OK UNK:NOWN NONE UNKNOWN N/A S I N 0 $18-25K $13-1 8K $7.5-1 OK UNKNOWN N/A N/A N/A co 0@ U M, $18-25K $13-18K $7.5-1 OK UNKNOWN LOW $1 OK ON PROJECT 0 ES T BASIS S S $18-25K $13-18K $7. 5 -1 OK UNKNOWN LOW $10K ON PROJECT R EST BASIS I P R 0 S I U T V T A $18-25K $13-18K $7.5-1 OK UNKNOWN $15-25K.EST NEGOTIABLE A T T 0 E F P R S I I T $15K WITH THE POSSIBILITY OF V N A $18-25K $13-18K $7.5-1 OK UNKNOWN REPLACING SOME IN-HOUSE A T PERSONNEL REQUIREMENTS T E E SECTION VI DETMUNING SPECIFICATIONS FOR A GEOGRAPHIC ENFORMATION SYSTE4 6. 1 Introduction . ..... 6.1 . i Data Specifications ....... 6.1.2 ..... 188 Geographic Referencing ............. ........ ....... @6.i.3 Information Output Requirements ................. so.* ........ 189 6.2 Specification Decisions and Decision@Variables ................ 189 6.3 The Continuing Process 6.4 Specifications of Initial Findings ........ 194 6.5 Structure afid Evaluation of System A Iternatives ... ... oooo#194 187 Determining Specifications for a Geographic Information. System 6.1. Introduction The determination of system specifications occurs when the results of steps 1, 2, and 3 are deemed satisfactory to continue. Step 1 should set the ideal objectives for a system and steps 2 and 3 describe constraints or at least identify. where additional work is required to overcome one or more constraints. System specifications generally consist of three main parts: identification of data elements, data formats/ mediums for record archiving; a description of the spatial referen cing system to be used; and information output requirements. The work to be summarized here is only initial as it results from only the first iterations of steps onel two, and, three. Data Specifications Data specifications should consider balances between desired resolution, data availability,, and costs of data acquisition. Desired resolution is a function of the minimum area un. it that can be singularly recognized for use in the system, and the type and structure of the classification codes for the ather data attributes to be associated with each spatial unit. Data availability and cost factors, if need be, can be adjusted to determine specific data elements, data formats, recording medium, and record format. This establishes data input requirements for the system. 691*2. Geographic Referencing The next aspect of system specifications is a description of the spatial referencing system to be included. The concern here is with the structure of data for the entire geographic system and the need to have a number of referencing systems. Any spatially referenced data base has as i fs cornerstone a reliable planimetric base for geographic referencing. 188 6.1.3. Information Ou tput Requirements The information output requirements are concerned with the manner in which information can be derived from the system and presented to the user. This includes all necessary data manipulation and analyses required to transform stored data into useable information. All functions must be specified in such a way that it is very clear how each desired output is obtained. Attention to detail will prevent potential difficulties in development and use of the system. Test analyses should be conducted to determine feasibility and preliminary cost information. The three parts of system specifications must be evaluated in concert to determine the extent to which original objectives will be met. This allows the explicit determination of system boundaries. System boundaries refer to the definition of the system in terms of several variables such as purpose, usersi subjects included and total geographic' extent. In situations where a defined users group is not well recognized in the design processsystem boundaries result from interaction of the system sponsors and system designers (Figure 6. 1). the major system boundary variables are: 1) type of system refers to the dedication of the system (i.e., totally operational or partial ly demonstrative and research oriented). 2) geographic area 3) subject - the general categories of data considered appropriatefor the system such as land characteristics, economic, demographic, hydrologic, data, etc. 4) object - performance for the user includin 9 types of analyses and geographic displays. 6.2. Specification Decision: and Decision Variables S It is apparent that there are many ded isions, both technical and administrative, which are made during the design and development of a@-geographic information system. Decision variables are 189 3 Systar: Nects the System meets some C. System meets total Ztal total le--Js of of needs of ne-is of one user 'Mers a sinole user a sinale user and some needs of am scrie needs of Other users tai-Z jsr 7 3 3 2 3 E. System meets cormn F. SYstem meets common A System meets total 14. System peatS n-jS needs of two users needs of three users needs of all users tnat ire carm,-a to and needs of third t.-D Cr revv @Sers user that are common with first two users Total needs of sinole user Figure 6.1 System-Users Relationships (source IGU 1972) 190 a tool to examine the scope of these decisions in a relatively ordered way. Such decisions generally contain a number of options. As well, before identifying alternatives a procedure must be developed to reduce the number of possible alternatives' without rejecting any major alternatives. A recommended process is to group the variables on the basisi of interdependence and then evaluate each group separately (see Figure 6.2 and 6.3). Figure 6. 2. Example of Grouped Decision Variables Data Volume and Handling scale data storage medium file structures number of alternatives for each element software hardware automation Such a systematic grouping is strongly recommended because it forces explicit examination of each decision variable and if executed it 'assures no major alternatives will be overlooked. Failure to use such a process can easily result in an early decision, possibly regarding scale, that becomes overly binding on further system development. 6.3 The Continuing Process The final step of part 1 is a complete evaluation of data specifications in terms of effectiveness, consistency and feasibility. Effectiveness is @simply evaluated by a comparison of final data speci ficationsl to system objectives. A consistency evaluation checks that initial system specifications are internally consistent and that they represent a solution capable of meeting objectives. Consistency between 191 geographic referencing capabilities, data storage specifications, analysis specifications, and information delivery'is the desired end result. Feasibility assessment is a broad somewhat subjective evaluation of the reasonableness of the specifications particularly with respect to data availability constraints and overall cost estimates for data acquisition. To summarize stage I of the design process, the desired result is a set of consistent specifications representing system objectives. These specifications are viewed as being satisfactory for use in the generation of system alternatives. If specifications are unsatisfactory more iterations of stage 1, which this report has outlined a first iteration of, will be required. Although a system design should not be totally constrained by available data, systems will of necessity be constrained by the availability of geographic referencing systems. So system objectives and,specifi cations which have been developed will not.only represent an ideal system but rather reasonable objectives warranting further evaluation. Some specific products of stage I should be: - Data definitions - Geographic referencing system design - Specifications for data, retrieval, analysis and information delivery Cost estimates for data acquisition, and Subjective and explicit evaluations by a large segment of the potential user community of the work done to date@ 192 Class if icL-Llon Spatial syste.-n represent-akdon Data a c T-i I S L _@ i 011i 'ho me t d Scale' Aut o rn t i o n. Data storage medium File Software Equip@nent structure- Response time Data User I Centralization confidentiality access of system Authority to avners' InStitUtional of systern arrangement operate system H Strong relationship - -------- Weak relationsalp Figure 6.,", Grouping Decision Variables (source U.S. G.S. 1975 193 6.4 In summary, Phase I involves a long iterative design process which is probably best accomplished by an in-house user oriented process and not by the extensive involvement of outside system consultants. -A state dota. and information planning program is viewed as an ideal vehicle for this process as such an effort integrally involves do to gatherers, data users, and data user needs assessment. 605. Structure and Evaluation of System Alternatives When the products resulting from Phase I are deemed satisfactory and the data required to meet these objectives have been identified and documented in terms of data elements', data formats, record medium and record format! alternatives to accept, store, manipulate and output can be specified and evaluated. For data handling this investigation includes factors related to software, hardware, operating policies and legal implications for each al.ternatives. Evaluation of alternatives is done to determine the technical feasibilit-/' of the data handling procedures and their associated costs. The formulation of alternatives is accomplished by @alecting one decision variable value for each dimension of the system and if the resulting combination represents a feasible solution then the alternative is eval ated in greater detail. After feasible alternatives have been identified, each alternative must be detailed in terms of hardware, software and an institutional framework for system operation. There are likely to be more than one possible solution per alternative as hardware requirements can be satisfied by a number of different machine configurations. Relative advantages of different hardware, software, and institutional framework will depend on factors such as. time availability, information availability on the alternatives and the level of analysis which is underway. 194 The selection of the system to be implemented considers the feasibility and cost evaluations in terms of system specifications which have deRned data needs and associate d manipulation. requirements. The alternative that best satisfies specified needs at the lowest cost should then be ident Iified. If no alternatives are found to be satisfactory then either new alternatives must be constructed or system specifications may be odified. The decisions regarding change should depend upon the factors that contributed most toward, rejection of an alternative. 6.6. Summary In summary this general design process is based on a systems analysis approach (see Calkins, I 1972f 'W-1972, USDI 1975f 1977). It does assume each task should be completed in an explicit manner and that one task leads directly into the next. The system should be best utilized in an iterative manner with greater precision desired in each successive iteration. It must be noted, however, that due to extreme difficulty in obtaining good data for the various steps, it wil I often be necessary to continue with less certainty than this process with all its flow charts appears to offer. The only real advantage of such a process is that it forces greater attention on details where otherwise broad implicit assumptions would be made and it requires documentation of every step,, decision, and investigation, which has been the thrust of this woOk and should be the thrust of the continuing GIS e ffort. This report has presented a First atteni pt at determining system specifications. As outlined in Chapter 11, this report only covers phase 1 of what is viewed as a three phase developmental proce S. s Many of theinputs which would be required for true specification generation were not yet available so much of this work is left for later iterations of the various stages of the process i95 SECTION 7 FINIDINGS AND RECOMMENDATIONS This section will be prepared in cooperation with a subcommittee of the Land and Water Resources Council after the report has recieved a full review. 196 APPENDIX A GLOSSARY OF TECHNICAL TERMS ACCURACY The extent of freedom from error. That is, the closeness of a measurement to the true value, or to a value accepted as true. ACCURACY, OVERALL Accuracy over the entire range of an instrument. ALGORITHM A computer-oriented procudure for resolving a problem. AUTOMATED CARTOGRAPHIC SYSTEM Automated methods of producing maps and charts, in graphic and digital. form, in oruer to reduce production time. BACKGROUND PROGRAM A program being operated upon by a computer system at low priority usually during times when the CPU is not in demand for interactive operation. BLOCK The physical unit of information within a particular data set. BORDER Exact term for the division between, two mapped areas which is interior to the subset being bounded. Synonym: Boundary. BOUNDARY General term for the division between two mapped areas. Synonym: Border, Edge. *Taken partially, from Amidon, in I.G.U. Geographic Data Handling A-1 CADASTRAL SYSTEM System containing information on parcel boundaries by ownership of property. CARD IMAGE A representation in storage of the hole patterns of a punched card.The holes are represented by one binary digit and the spaces are represented by the other binary digit. CELL The smallest region in a grid. CHARACTER RECOGNITION The assimilation and analysis of pictorial information relative to numeric or alphanumeric characters. CHOROPLETH MAP Map showing discrete areas such as,states or counties. These units are consi- dered uniform with respect to the statistics collected within them. COMPILATION (A) Selection, assembly and graphic presentation of all relevant information required for the preparation of a map. Such information nay be derived from other sources. (B) A graphic document produced by this process. COMPILER computer program to translate a source language into an object language. or Operator of Photogrammetric machine who proauces a two-dimensional diagram from the stereo-photographs. or The cartographer who carries out the process of map compilation. A-2 CONFIGURATION The specific arrangement of system computer hardware and peripherals making up a system. CONTOUR Line joining points or equal vertical distance above or below a datum. CONTOUR PLOTTING The process of plotting (or drawing) contour lines from a data file consisting of a set of points representing a contour line. Synonym. Contour Drawing. CONTOURING AUTOMATIC The process of calculating ana plotting (or drawing contour lines with a computer and peripheral devices) using a set of points representing arbitrarily, selected positions on the topographic surface, and derined according to an arbitrary coordinate system. CORE The fastest access storage part of a digital computer usually utilizing magne- tic cores. CRT DISPLAY Cathode ray tube display. CURSOR Aiming device, such as a 1ens with crosshairs, on a digitizer. DATA BANK An information store usually in digital form organized in such a manner that retrieval and updating can be done on a selective basis and in an efficient manner. A - 3 DESCRIPTOR The catalogue descriptive part or digitally stored cartographic data or The keyword used in library information systems. DIGITAL Representation of data in the form of bits. Contrast with analog. DIGITAL IMAGE A two-dimensional matrix which represents an area on a photograph. Each position of the array is assigned a grey level, which may be limited to two. Synonym: Digitized Image, Digital Picture Function. DIGITATION, AUTOMATIC The process of convers ion of analogue or graphic data into digital form using automatic processors such as ALF character recognition pattern recognition, or scanning. DIGITIZATION, MANUAL The process of conversion or analogue or graphic data into digital form by an operator with or without, mechanical or computer aids. DIGITIZE To use numeric values to represent data DIGITIZER., GRAPHIC Machine that changes graphic cartographic information into a digital format for computer input. DIGITIZER, LINE_ FOLLOWING Device which automatically tracks an individual line and at selected intervals digitally record its -position with respect to an arbitrary coordinate system. Synonym: Automatic Line-Following, A.L.F. DIGITIZER, POINT A manually controlled cursor senses position, usually by electromechanical means. An operator must activate the recording of positional elements or other information. DISPIAY Any graphic presentation in hard-copy or as a transient image or device (usually CRT) attached to a computer for the rapid display of selec- table information in map or list form. DISPLAY, LINE,DRAWING A display system (usually CRT) which produces an image from lines drawn as a series of dots or vectors on a screen. DISPLAY, RASTER A display System (usually CRT) which produces an image from scan lines in a raster format. Usually used with refresh type displays. DISPLAY, REFRESH A, display system (usually CRT) which produces a transient image and which must be refreshed about 60 times/sec.,in order to prevent flicker. Comparable with a standard TV display. Limitied in data content because of time. DISPLAY, STCRAGE A display system (usually CRT) which produces a stored image at high speed on a screen and which will remain unless erased for a period of at least one hour. Not limited in data content except from aspect of visual clutter. DRAFTING UNIT A digiital computer-controlled x, Y mechanism capable of drawing lines to high accuracy. A-5 DRUM, STORAGE A rotating magnetic drum storage division for digital information used for relatively fast date access in computers. EDITING Detection and correction of the data obtained in graphic data reduction. ENCODER A device for converting linear or rotary mechanical motion into precise digital form. Widely used in digitizers and for position feedback for drafting units. FILE A variable number of records grouped together and treated as a main division of data. GEOCODING Geographic referencing or coding of location of data items. GEODETIC COORDINATES Latitude and longitude with reference to a standard spheroid. GEOGRAPHIC BASE FILE Coded network. GRID COORDINATES Euclidean coordinate system in which points are described by perpendicular distances from an arbitary origin. HARDWARE The physical components of a computer and its peripheral equipment. Contrasted with software. ISLAND Single-line boundary within a polygon. A-6 KEYBOARD A manual input device for function operation or alphanumeric entry. LIGHTHEAD Lightspot (and possibly symbol) generator - used by high accuracy drafting machines to record lines and symbols on photosensitive material. LIGHT PEN See Interactive Positioning Device or A device the size of a ball-point pen which is used for pointing to a location on a CRT screen. The coordinates of the location are obtained usually from a time lapse measurement within the raster image formation. LINE PRINTER A peripheral device for computers which prints a line at a time. At each posi- tion along the line a set of alphanumeric characters is available. Maximum line length varies by manufacturer, but, usually 120-130 columns are available on hi-h-speed printers. It can be used for high-speed listing or, by spacing symbols, as a plotting device. MAGNETIC TAPE, DIGITAL A method of storing data by selective polarization of the surface of a ferrous- coated tape. A large reel of magnetic tape (2400 Ft) will store a large amount of data but sequentially. MERGE Combine two sequenced files into a single sequenced file. or Joining two or more lines and areas together. When this operation is not simple a combination of deletion and interpolation may be required to present a plea- sing appearance. A-7 MINICOMPUTER A low cost computer with limited core capacity. Widely used for device and system control and data handling when large computations are not involved. OFF-LINE Processing is not directly under the control of the central processing unit. ON-LINE Processing is directly under the control of the central processing unit. OVERLAY Map of an area to be superimposed on one or more maps of the same area. The purpose is to find data combinations, or more exactly intersections and unions or Digital image of areas as in definition one above. OVERPRINTING Superposition by successive printing of line-drawn or continuous-tone informa- tion. PLOTTER An X-Y mechanism controlled by a computer generally for the recording of loca- tion information, e.g. symbols, names, etc. Line drawing may also be carried out but units capable of high accuracy line drawing usually are referred to as drafting units. Lines are drawn as a series of vectors. PLOTTER, DRUM A plotter where the sheet material is transported by a rotating drum which pro- vides the motion for one axis of the mechanism by a forward or backward motion. Pens are mounted on a bar parallel to the drum axis. The bar movement provides the second axis. A-11 PLOTTER, FLATBED A plotter where the sheet material is fixed on a flat table survace and the pen or printer is carried by a gantry and trolley mechanism providing two axes of motion. PLOTTER, LASER A plotter in which line vectors ire drawn by a laser bean deflected by galvan- ometers. POLYGON Plan figure consisting of three or more vertices (points) connected by line segments or sides. The plane region bounded by the sides of the polygon is the interior of the polygon. RANDOM ACCESS Process of obtaining information from or placing information into storage where the time required for such access is independent of the location of the information mostrecently obtained or placed in storage. RASTER, SCAN A line-by-line sweep across a display surface to generate or record an image. SCANNER Any device which systematically breaks up an image into picture elements (or pixels) and records some attribute of each picture element. SCANNER, DRUM Apparatus which scans and records 2 or moregrey levels, usually of reflected light from a picture fastened to a rotating drum. SCANNER, FLYING SPOT (CRT) Apparatus which scans and records grey levels, usually of transmitted light, by electronic means. A-12 SCANNER, LASER A scanner plotter with the exception that the light source is replaced by a laser to give very fine resolution. Total map production time is normally in- creased appreciably. SCANNER, PLOTTER A device which normally consists of a continuously rotating drum mechanism carrying and photosensitive material and a variable intensity light source which linearly traverses the length of the drum. SEGMENT Subset of consecutive polygon points. Synonym: Link, Arc. SEQUENTIAL OPERATION The performance of actions one after the other. SMOOTHING Filling a line of observed data points by a continuous line. SOFTWARE Programs used to control the operation of computers, originally this term included program compilers but was extended to programs in general and even to reports. SUBROUTINE A computer program which uses data or instructions from another program. A means of dividing a large program into smaller routines and which may be used without modification in diverse applications. UNIFORM GRID Square rectangular or, more rarely, hexagonal lattice for recording geographical data. The simpler grids are usually not related to geodetic coordinate systems. VEUTOIR A linear line segment, normally short, used to construct any line form on a plotter, drafting unit or display. WINDOWEIG A -method of designating and separating out a particular area of map data for presentation on a display. 14 APPENDIX B A REVIEW AND EVALUATION OF SELECTED CURRENT AND DEVELOPING STATE GEOGRAPHIC INFORMATION SYSTEMS I. The New York Land Use and Natural Resources Inventory (LUNR) II. The Maryland Automated Geographic Information System (MAGI) III. North Carolina Land Resources Information System (LRIS) IV. Montana Energy Resource Geographic Information System (ERGIS) V. Texas Natural Resources Information System (TNRIS) VI. Summary and Evaluation B-1 1. The New York Land Use and Natural Resources Inventory (LUNR) I General Description: The Land Use Natural Resources' Inventory is a computerized record of an aerial survey of New York's land resources. Supported by retrieval, analysis, and display computer programs it is the most well established st ate-wide land information system in the United States. The LUNR system contains 130 land use categories and four categories of supplemental data for the entire state. This data was manually encoded (grid overlay), then converted to automated records and stored as a sequence of descriptors for cells one kilometer by one kilometer. Two sets of computer programs are available for retrieval and analysis of stored data. The firstDATALIST provides tubular summaries of raw data and !;limited statistical analysis. Plan MAP IV, the second outputting programs, has sophisticated analysis capabilities and produces computer graphic maps. 2. LUNR History LUNR began from a formal pledge by Governor Rockefeller in 1966, to provide an inventory of the states natural resources. At this time the Center for Aerial Photographic Studies at Cornell University had just completed an inventory of the resources of the north- eastern section of the state using air photo interpretation. The data derived from this study had been coded and stored in a computer usin the 8ymAp program. Based on this experience 9 Cornell was contracted to develop LUNR. ! Prior to the development of LUNR, researchers at Cornell had gained considerable experience with gathering and analyzing land resource related information for large areas. Redevelopment of LUNR was reviewed as an opportunity to develop techniques applicable to similiar projects and problems regardless of particular geographic locations. However, final system operation was required to be independent of Cornell expertise and workable utiltzing commonly avail- able techniques and equipment. Its concept was not one of an ultimate geographic information B-2 system but one of operational applicability currently before more sophisticated techniques are available on a practical basis. As well as in New York, the system has been applied in Puerto Ricof El Salvador, and Colorado. -State Planning first required performance of a five county pilot study.. The pilot study purpose was to test classification systems and do initial computer programming. Initial plans called for the application of SYMAP to store and retrieve the collected data base. The pilot study showed this program to be incapable of effic iently processingIarge volumes of data, so P a new set of programs was devised. An initial land -use classification scheme of six basic land use grew to a list of 130 land use types after interviewing potential users from state agencies, planning organizations, and universities. The LUNR Land Use Inventory was conducted between 1968 and 1970 for a cost of $500,000. This broke down to a cost of $10. 00 per square mile. The federal government, dispersing Ap palachian development funds,, paid over 701 of.this cost. Once the inventory was completed a statewide users.service was established to serve the general public and local planning and governmental organizations. Two separate operations are involved, one at Cornell and the other at State Planning in Albany. 3. LUNR System Description 1 . Data Acquisition Compenent Aerial photography was the data source for of the data in the LUNR system. Pan- chromatic photography of a scale I in. = 2,000 feet was used to map the entire state. 17,000 9x9 prints were required to map the entire state. Those were combined into 129 I arger photo mosaics. About 151 of the, land use data required some t-ype of field checking. Supplemental data such as highway access were recorded Lin this way. Four other supplemental data classes were developed for incorporation into the LUNR B-3 system. This section consists of four@ general parameters which were geographically referenced to the 1 km Z grid system. The data are: 1 General soils map of the states which was derived from a map prepared by USDA. This map contained 214 soi Iseries. Resolution is limited to units of greater than 300 acres. A single L,UNR inventory cell occupies 247.1 acres. A single value - the dominate soi I mapping unit - was recorded for each cel I except where less dominant occurred often in an area but were masked by the dominant soi I of each cell. When this occurred a representative number of cells were also encoded with more than one soil type. 2. The general geologic map of New York which contained 152 bedrock categories was collapsed into a map containing only 11 functional mapping units. These units were interpreted as tol their basic characteristics, and capabilities. As with 'nt rock type for each cell is recorded. the soils maps, only domina 3. The third map, land form and bedrock depth, indicated land form surficial geology types and over burden depths for only a portion of the total statewide coverage (based on development pre I ure). Data values were recorded as percentages of ss each kilometer grid cell. The Final data type, Agricultural Economic Viability was prepared on the basis i of interpreted soil data, to ography, climate and water resources. Parameters P, I for locations markets, access and social cultural viability were also recorded. Three levels of viability were recorded and entered into the system. 3.2 Data Inputting As previously outlined, the system uses 1 km Z cel.ls related to USGS quads. This 2 system resolution is I km . This is recognized as overlay gross for urban planning but rural planning is the centrallfocus of the entire system. At this level 140,000 cells were required for statewide coverage. Each has an X and Y coordinate which is referenced to the southwest corner of the state. B-4 In terms of map preparation three overlays were required for land use. (1) Area over lay-polygons of uses were outlined and I area of each cell was estimated. I acre is the smallest unit of mapping for source data inputting. (2) Point overlay - consisted of small features which were noted as present or not and streams were valued by length per cel I. (3) Compilation overlay include all minor civil division and roadway' classes and lengths per ce I I . Coding sheets were used (one per cell) for each cell and land uses plus the supple- mental data types we re punched onto cards and then merged onto disc storage. Coding for all for geographic referencing was by coordinates, counties, minor civil divisions? and watersheds. LUNR data is stored on two direct access disc packs. One disc contains the land use data and the, over contains the four categories of supplemental data. There is storage presently available for 200 data items per cel I and this capacity can be ex- panded to thousands per cell. Data Retrieval, Analyzing and Output Two types of retrieval and analysis programs are developed specifically to provide easy do to access so unskilled users can retrieve desired data and perform sophisticated analyses without the assistance of trained programmers. DATALIST provides tabular summaries of raw data or analyzed data. Oitput is by cel I. Ten data categories can be examined per computer run. DATALIST is useful in statistical referencing providing an inexpensive means for matching cells and data characteristics. However, it cannot recognize patterns or identify adjacencies. Addition, substraction'. multiplication, division, and corss tabulation are the primary mathematical functions provided. PLANMAP programs produce line printer maps which show up to ten visual density displays produced by overprinting. Steps in PLANMAP analyses are study area delineation, assignment 13-5 of weights to data values, and excluding unnecessary data elements from analysis. Once established data are divided into final weighted values with up to ten different data levels. In addition some more specific optional analysis steps are: conditional weights (if then statements), fixed weighting ranges, scale factoring, decimal setting, and a number or' format commands for map and tabular display. 4. System Us The overlay DATAILIST and PLANMAP have all been used in the day to day activities of the State Planning Office in Ne York Regional agencies and communities have as W well used the system extensively. Some specific applications have been: Micropatterns; study which examined the impact of a range of proposed state facilities in central New York and then to develop recommendations concerning the role of the State Planning Office in facility siting decision making. The Appalachian Area Resource @Study utilized the system for providing statewide data for New York. .State Parks and. Recreation uses the system to conduct surveys of potential recreation a d openlands in conjunction with the pr eparation of statewide outdoor recreation n plans. Others have been reservoir siting, market studies, geologic feature identification, nuclear power plant siting and new towns siti[ng. 5. Future Plans Although early plans called for periodic updates it now seems likely that an entirely new data base and storage and retrieval system wi I I be developed. An interagency task force n investigating state spatial informatio needs recommended a system with greater resolution be tailored to fairly specific user nee Ids and be regularly updated. B-@ Field check Receive aerial aerial photographs photographic inter retation Sort photographs Aerial Draft Code data to Key punch Transfer according to USGS 'photographic LUNR Universal ---w coded data on 0 data to Transverse quadrangle maps interpretation overlays to EDP cards disk. pack Mercator grid Gather supplemental data. PRODUCT PRO UCT Data sLdtable for LUNR overlays of com uter USGS 7.5-minute . p quadrangles manipulation and display Figure 1. LUNR. Procedure Summary al Inter retation, r aph t ov -:e atjon@ fun Aw low aw I!.: The Maryland Automated Geograp hic Information System (MAG 1) General Description_ The MAGI System'@tores geographic data in a consistent and coordinated manner similiar to integrated base mapping systems. "The !data bank and associated data handling system developed are utilized to generate computer maps displaying the capability and suitability of the land for various uses across the geogri phic context of the entire state of Maryland. The input utilized to generate these composite maps are numerically wei ghted map variables collected and stored in a geographically @referenced data base.The computer map outputs are used in a variety of ways which generall relate to the development of a Generalized State Land Use Plan.Maryland planners are cu ently utilizing the system in the interpretation of photographic @nd satellite data from tKI Z i1. For the state the objectives of I MAG I@ go beyond initial resource inventory, data base managment,and analysis of landscape potentials.The system is hoped to lead to,the establishment of a basic structure for continued state wide integration and analysis of numerous types of geographically based statistics(i.e., census and socio-economic data as well). It is recognized that the system will never replace all forms of m anual data handling but will continue to operate as a tobi for increasing analysis capabilities and decreasing the costs of statewide data storage and retrieval. MAGI History The system was designed and implemented by the Environmental Systems Research 1-nstitute of Redlands, California under the joint sponsorship of 11-he State of Maryland and the National Aeronautics and Space Adminis Itration.Work was begun in 1971 and the system was operational in 1974. B-8 Impetus for the development of a geographic information system was legislation creating the Maryland Department of State Planning and assigning to it the responsibility for preparing and continually updating "plans for the development of the state,which plan or plans collectively shall be known as the State Development, Plan. Realizing the need for the systematic development of such plans and the requirements for extremely large scale data storage and analysis operations the Department of State Planning initiated six general efforts leading to a state wide planning information system. These efforts are as follows:Othe selection of the significant geographic indicators which influence intelligent decision making for land use planning;2)the collection of existing base maps which described Important geographic differences in these indicators;3)the design of automated computer filing systems for this geographic data; 4)the,desi gn and development of comput er programs and procedures for analysis, retfieval and updating of computer encoded data types;5)the development of computer models for the analysis of data and Othe development of in house capabilities for operating the system;specifically training state personnel in complete system use. MAGI System Description 1. Data Acquisition Component At the time of the project's initiation the state had already maintained a substantial amount of high quality data for state level planning.Th is data was checked ror its validity, level of applicability, timeliness, compre hensiveness and flexibility in potential use. Each source agen cy was asked to identify the idiosyncrasies associated with their data and present a complete analysis of their perceived data limitations and usefullness. Data was collected from many sources. Eachof these sources maintained data of varying quality and age,with incompatible scales on different base maps.Because of budget constraints B-9 this data was not always remapped on uniform base maps but each series was referenced tothe State Coordinate Grid System and reformatting occurred subsequently during the digitizing and data processing stages of the system. A total of Fifteen landscape variables were selected and gathered for the entire state.They were selected according to their applicabillity to three categories:capability onalysis;suitability analysis;and special studies analysis. Utilizing the definition that capability, analysis is the "ability of a resource to support various activities,and various levels of activities because of inherant physical activities, " soils, geology, aquifers and aquifer recharge areas, mineral resources, topography,natural features, vegetation and surface hydrology were selected as primary capability variables. Suitability is defined by Maryland planners as an "entity, usually manmade, induced or influenced that allows one to indicate preference as to a resource supporting various levels of activity given the czaabi lity of alternative sites or areas. Pubi ical ly owned property, sewer and water facilities, transportation systems, historic sites and existing land use were identified as primary suitability variables to be included into the system. In addition to the thirteen capability and suitability variables several other data items were collected and digitized for special purposes. Watershed boundaries were included to be used in the mapping and analysis of stattstical data' generated for surface water in NWpicind by U.S.G.S. and EPA. Electoral districts were included because census tract data breakdowns are nested by electoral boundaries and state tax and land use records are also maintained at this level of spatial c4gregation. The fifteen data variables came from @ er thirty data sources and were compressed into thirteen OV sets or' maps covering the entire state. Mine! ral resources and aquifers and aquifer recharge areas were mapped together as were electoral districts and watersheb boundaries. B-@i 0 Al I base maps were atone of three scales-1:62,500,1;63,630, and 1:126,720.With the exception of land use all xx:pc variables were mapped and eventually digitized on a county basis.As statewide analysis required some type of coordinate referencing system for the eventual merging of separate county data files a localized version of the State Plane Coordinate System which utilizes Lam6ert Conformal Conic Projection or Universal TransverseMercator was adopted. Each of the thirteen variables on the county maps were referenced with this system. 2. Data Inputting Two systems for digital encoding were employed-manual encoding and automatic digitizing. For the manual coding a grid was drawn at the appropriate scale for the different base maps but would correspond to 10,000 ft. planar lines was drafted and overlayed for referencing.A smaller g6d at a scale of 2,000 ft. per encodingi3ell was drawn on the overlay.The rows and columns for each cel I was then numbered. For the data types mapped as polygons, the, recording for each cel I was determined by the polygon.which occupied most of the area of the cell.Land use included encoding the secondary and tertiary polygons and soi Is the secondary polygon. Data mapped as lines were recorded by determining the most important qualities occupying a given cell.For example, if two roads of varying size Filled the some cell then the larger road code was assigned. Map data expressed as,pointsCi e.,natural and critical areas) was encoded by row and column of the cell and number coded for individual reffieval. All variables except for watersheds and electoral districts were encodeId by manual techniques. These maps were digitized as polygon source maps. Once digitized a, special ppogram was utilized to convert this polygon data to grid data to maintain compatibility with the riles which were manually encoded. Because of relatively large polygon size for these data variables this procedure saved a considerable amount of time in cell by cell coding and ediflng.. 3. Data Storage Single variable county files were formed after the first edit was performed by producing maps B-1 for each file and systematically comparing it with base data. It generally required three complete edits to produce clean single variable data files. -Multi variable county files were formed after so the final edit of the single variable files. Single variable files were combined to form a multi-variable file for each county in the state. In order to check for a direct overlay a special mapping routine was applied which would delineate valid data mismatches, drafting errors and invalid mismatches of data variables. Such errors generally resulted from poor grid alignment on the maps which were encoded at different base scales. Such errors once identified were easily corrected. -State-side files were created utilizing a different procedure because creation of the county files only required a simple combining of different data items the merging of counties required more sophisticated software insure proper spatial placement of adjoining but still separated map boundaries. Decisions concerning coding were required when adjoining data which would reside in the same cell once the state-wide file was formed.For example an overlapping cell could be encoding one way on one county map because of the dominant polygon and encoding something completely different on an adjoining county map because of the presence of a different dominant polygon. Figure 1 DIAGRAM OF A COMMON CELL OVERLAP PROBLEM County A County A Encoded Primary County B County A Encoded Secondary County B Encoded Primary County B Encoded Secondary B-12 A co ticl?oin;n- of bordering mputer program (MAPMER G E) was applied to permit the accurate data record.The running of this program for construction of the state file.1nitially involved the assembly of county files into, regional area files which corresponded to standard regional planning land use districts in the state.The regions were then merged into a fin al single state-side File containing: TABLE I Variable Variable Name Cof Digits I Grid cell row number 3 2 Grid cell column number 3 3 county number 2 4 surface water quality 2 5 engineering geology (primary) 2 6 engineering geology (secondary) 2 7 transportation facilities (primary) 2 8 transportation facilities (secondary) .2 9 ownership (primary) 5 10 ownership (secondary) 5 11 mineral resources 2 12- sewer and water districts 2 13 vegetation 2 14 soils (primary) 3 15 soils (secondary) 3 16 soils (tertiary) 3 17 natural features (primary) 3 18 natural features (secondary) .3 19 topographic slope (primary) I 20 topographic slope (secondary) 1 21 watersheds and sub-watersheds 4 22 electoral districts 2 .23 historic sites (first in cell) 6 24 historic sites (second in cell) 6 25 historic sites (third in cell) 6 26 land use (primary) 4 ZI land use (secondary) 4 28 tertiary land use (1973) 4 29 primary county comp plans 6 30 secondary county comp plans 6 31 tertiary county comp plans 6 105 B-1 -3 Data Retrieval, Analysis and Outputting The MAGI system was developed w th the capability of producing computer maps for editing data as well as Final display graphics,B Iecause of the large number of data varic ble classes for certain variables(i.e.,soils)it was impossible to graphically display all variations on a single mcp.For Final modelling and presentation it was decided that all maps would callapse sub-classes into ten or less final mapped variations@ An example of this procedure is the aggregation of thirty soils series into ten new soils classes based on agricultural productivity for modelling any information which required productivity data in the procedure. Most of the models developed as integral parts of the system were descriptions of the suitability and cupability of the land to support various land uses.The outputs of the various defining the models were generally series of grid maps graphically capabilities of the state for various potential activities. Each cell can be ma pped with a value which expresses weightings assigned to one or more characteristics' of the cell.Weightings were established by estimating constraints to and amenities for each data variable to all other variables and Finally relative to the specific activities being, modelled. System Use To date the system has been primari1y used within the state planning off'lce.However a number of specific projects have been able to incorporate system analysis and output nroducts into their overall frameworks.For example: I.Mapping mining and extraction potenfials@for the entire state, 2.1dentifying all agriculturally productive I'ands-both actual and potential. Undustrial location siting studies. 4.Mapping residential development suitability and capability for rapidly deve7loping areas of the state. 5. Preparation of -a state-wide comprehensive recreation plan B-1 4 DATA mm:,rt- ANALYSIS XNCODTNC LTDRARY PREPARATION DATA COMPUTER FIRCING FMA-1 CMUTFR VICATION & REFOPMATT INC EDITING REFEUNCINC OF I-IAHACE- SOMARE COMM TON LXYLE- @O.DZLS 'VELOMUr 07 DATA @M`TATIO's ACQUISITION SYSTIM HATERTALS RMT DL DOCUK211- VILES TESTING' TATION or SYSM Soils $2,600.00 $11,300.00 $13,835.00 VIA $6,000.00 $5,700.00 $1,485.00 Coolony 450.00 3,000.00 5,000-00 350.00 550.00 990.00 Slope 500.00 2,150.00 3,727.00 N/A 350.00 650.00 990.00 mineral Resources 450.00 1,000.00 1,160-00 N/A 350.00 450.00 220.00 Aquifers 400.00 1,500.00 1,000.00 N/A 350-00 400.00 220.00. Surface Waters 200.00 11100.00 1,150.00 N/A 300.00 300.00 990.00 Natural Ventures 1,200.00 4,300.00 2,160.00 $ 1,750.00 8,000.00 2,350.00 500..00 495.00 VeSeLation 300.00 4,500.00 3,757.00 N/A 700.00 650.0.0 Wildlife Habitat 350.00 2,000.00 700.00 650.00 N/A (Not Digitized) Sever & Water Service 500.00 1,600.00 1,010.00 N/A 1,000.06 .600.00 495.00 Transportation 200.00 2j800.00 11420.00 NIA 700,00 300.00 990 00 Public Land 500,00 2,300.00 1,031.06 1,200.00 1,000.00 200,00 500000 990.00 lligtoiic Sites 300.00 4,800.00 2,350.00 2j400.00 1,000.00 500400 300.00 990.00 Land 'Use .500.00 650,00. 990.00 .(1970) 500.00 1,000.00 2,568.00 N/A LAnd Use (1973) 200.00 30,650.00 Planned X/A 500400 200.00 990.00 ?long 300.00 2,700.00 Planned NA 500.00 500.00 Boundaries 300.00 2,250.00 1,750-00 N/A N/A 500.00", (Electoral Districts Wateraheda) Ron-variable Coots 2.250. 6,440-00 3,200.00 Y, 400. 9,250.00 08,950.00 $44.006.00 5,350.00, 412,300.00 $17 7 940.00 $ 10,835,00 $.3,200.00 $2,500-00 North Carol ina Land Resources Information System General Description The stated purpose of this developing system is to "provide local governments, state agencies and private citizens and businesseS with the information capabilities necessary for making land use decisions." Thus because this system will be used at various levels of operation and by numerous users for a variety of tasks system flexibility was the primary desing objective. A basic concept of the system is@to include both manual and automated techniques for data storage, retrieval and manipulatio@ . The manual aspects of the system involves, among other things a catalog of land related information. The automated aspect of the system will pro- vide for the capability for collecting data in computer readible form, storing and manipulating it with the computer and retrieving the information in plotted, printed, or tabular form. Exi sting computer facilities of the state Office of Management Systems.which consists of an IBM 370/158. will serve the bulk of the system. This facility is being used for large econo-- metric modeling, processing of large quantkties of data and when volumes demand, storage of data - Additional hardware was acquired t provide capabilities required for acquiring storing, 0 manipulating, and outputting geographically referenced, spatial information. Realizing the complexity of such a system and its extentive data base of land use and I nd related information the creation of the data base is proceeding incrementally. As land re- an source data is gathered by various agencies@and groups for their particular problem solving tasks the data base development will continue. In this way the amount of data handled will accumulate gradually being entered into the system as i'@f is captured. The collectors and users of the data will be the ones to enter it into the data base, In order to construct a data base@which contains accurate, useful and legally defensible i I data the data is collected and entered into the system in its most basic form. in this way, the accurracy and integrit-/ of the data base.can be maintained and flexible use is assurred. By storing spatial data as lines, points, and polygoms @information will be provided to the widest range of B-i 6 users. The system will tie to the State Plane Coordinates as the overall system of reference. in this way, ai I data stored in the system is geo-coded in a similar manner for the entire 5fafa. A Mrarchial grid structure is being utilized as a rrechanisn-, fcr ensur-.-,j compatibility and consis- tency for the collected data. One system will '!-@e usin,-, a ccrnpatible 'nierarchial structure of grid cell sizes, rancing from very small to very large. LRIS Hlistory Under the North Carolina Land Policy Act of 1974 a Land Policy Council was created and directed to carry out several mandates of which the primary one relates to the definition, preparation and maintenance of an information service for the land resources of I'he entire state. One Land Resources Information System. is entented to provide a system into which the land re- lated data collect ed by various agencies can be entered and s.tored in a consistent compatible way and be easily retrieved by various users. LRIS Descriptio The following list presents important elements of the system which is currently in the design, phase. 1) System data inputting. a. digitizing manually with interractive edit for capture of polygons. b. input data by coordinates c. input data in grid form d. a I I data con verted to State Coordinates 2) Data Editing a. interreactive b. machine data compaction and reduction routines 3) Retrieval. and Display a. print and plot source data b. retireval of digitized map B-1 7 c. recal I by map,. attribute, combinations of attributes d. recall by windowing 4) Hardware Capabilities a. disc storage b. drive digetizer and ploiter c. Fortran IV compilation d. hardwired floating poin ,t e. tape drivers 5) Software capabilities a. mathematical and statistical calculations - area, len gths, regression-, cross tabulation, proportions b. scale changes c. overlay grid and polygo i. n da ta d . perform Boolian combinations e. deal with island within polygons f. input, output and convert data in - UTM coordinates - State Plane Coordinates g. expandable file sizes h. expandable analytical cmabilities Projected System Use .I The State departments of N atural and Economic Resources, Environmental Manage- ment Division propose to use the system to digitize topography, climatalogic data and soils to plot this inform tion for overlay with land cover and use activity data for non-point source regional wastewater management. 2. Interactive plotting and analysis of LUDA data which will be available on tape in a computer ready format. 3. Historic Preservation - Stud of archaeologic sites, old roadway networks, and house Y 18 and barn types which will atlas and aid in design of predicture models relating historic site locations with known environmental and socio-economic conditions. 4. Resource inventory with inferractive overlay capabilities for overlay with various socio-economic data types. 5. Economic Development - with data available, analysis of locational characteristics and suitability and capability analyses will be undertaken. 6. Forestry - the analyses and plotting capabilities of the interractive graphics system wil I be used to quantify the type and extent of forest operations being carried out and to correlate these factors with the location of forest operations and determined in earlier data gathering. 7. The operational interactive graphics system will permit the review of land I water and air dumping permits in terms of impacts on areas of environmental concern, as the Coastal Zone Management Act of 1972 calls for a review of development permitting based on utilization of all possible resource and socio-economic data. types. System Program-and Costs The system once implemented will operate with a support staff of seven persons. Figure A presents the organizational diagram for this operational support staff.. Project Manager F Applications Applications Analyst Analyst Sysiems Analyst !Appl ica tions Proqrammer! Applications Programm- Represenatives Costs for-system development are presented below. Direct purchase of 165 equipment alternative was selected to augment .existing IBM 360 capabilities. A I tionsl ATP 1@,c a, B-19 COSTS Alternative 1: Direct!Purchase of IGS Equipment Calar-dar '11ear @1977 Jan. - Junei i July Dec. Jan. 1977 1977 1978 Canital Costs: Fiscal Year 1978 (Software & Hardware) Subtotal $130,000 Operational Costs: ,-Staff Salaries $4.4,406 $.44,406 .$44,406 -Staff Training and Travel 6,025 61025 22fice Miscellaneous: -Supplies 90 90 90 -Telephone 60 60 60 -Desks(6), Chairs(8) 1,200 Maintenance 7,500 7,500 7,500 Disk Work Packs 3,500 3,300 Plotter, Forms, Pens 150 150 150 Printer Forms 90 90 90 Physical Site Preparation 1,000 Subtotal $64,021 $61,821 $52,296 GR_LN'D TOTAL $214,021 FUNDING' Alternative 1: Direct Purchase of IGS Equipment Calendar Year 1977 Jan. - June July - Dec. Jam. - June 1977 1977 1978 Fiscal Year 1978 Funda for Capital Costs: Dept. of Administration 50X0 Research and Development . Land ?ali-y Counci2 10,000 -Research and Development Land-Classification MIM -203 50,000 Dept. of Cultural Res. 5,000 Subtotals $115,000 Funds for Operati nal Costs: Land.Policy Council- 13,000 25,000*. 25,000- 7,500 7,500 DNER -I--%-IMSAT Training 9,000 Parks Recreation 3,000 51000* 51000* Subtotal $ 25,000 $32,500 02,500 G,RAZM TOTALT $140,000 *expansionary budget **this funding is not assured at this time 40 B-27 IV The Montano Energy Resource Geographic Information System (ERGIS) I . General Description The general purpose of ERGIS development was the establishment of a geo-referenced data bank for planning land and resource utilization in, Montana. The system, more specifically, was designed to increase work efficiency and to add the capability of handling complex environ- mental data for resource management wi Ith the Montano Department of Energy Resources. It was decided early in the system develo ment that the focus would be on: a. converting all resource information into digital form for data manipulations; b. the conversion would be as autorAated as present technology would cost-effec- tively permit; c. constant and instantaneous revision and updating of the data base would be required; d. al I types of errors woo ld have to be greatly minim ized; and e. inventory maps should,be produced at any scale, by an area and by any combination of data elements. 2. ERGIS Histor The ERGIS system was developed in-house by systems analysists with the Energy Planning Division. Work was initiated in 1974 and parts of the system were operational by mid 1976. The system, is currently utilized and supervised by the Energy Planning Division but through cooperative agreements, a number of federal and Montana State agencies have.also used the system. 3. Specifics of the ERGIS s' An overall review of the ERGIS ystem and the inter-relationship among its subsystems is show in Figure #1 . The major subsystems are 1) input material analysis, 2) input device selection, 3) storage formats, 4) storage@ devices, 5) output formats and devices, and 6) mani-- pulation specifications. B-29- M IM @Now W MIMM !.,put giterial @Angq of %ra I e% tv.- i a I IDOVIce quantitiCation =CPI. and f -lu 1 7135 _0 T-1 =D"C anO -ITI-21P @Iizel J'@ctio qualification t A So-c Gr I d ".nq ............................ .... di on Age :."-tQri a) '"rint natio 4EEI nv. rs ion -Polyool PA tert ... .................. -7. ro M'., sols j ng W Figure I "0 - Ellv 11 'CC-Lt. oi -I. Data Input Subsection: For the system data bank, most input materials are in cartographic map formats. The system designers attempted a systematic quantification and qualification of all potential input materials in order to maximize eventual @system efficiency. After an extensive study, similar to the methodology proposed in the main body of this report, input data and devices were selected. The digitizing equipment consists of a Broomall GP-100 vaster scanner. This piece of equipment can scan documents up to bout 9-1 inches by 23 inches. The document is s-ccrined a 2 into a cellular format at a resolution of i@0, 100, 200 or 400 cells per inch. This scanner reads the document merely by shining a light on it and measuring the intensity of reflected light. Once scanned, the data is plotted and checked and then written to tape. This data can then be taken to the main computer for processing. (Fig. #2 presents the system hardware organization). Based on an extensive review of current data storage and retrieval techniques the micro cell system was selected. ERGIS researchers felt this offered the greatest efficiency of random accessibility, greatest flexibility of arbi frary output formats and the greatest data c om patibility. Some of the data items which have been input into the system include Physiography, Vegetation, Sediment Risks, Tree Size, Forest Stocking, Specially Managed Areas, Existing Land Use Patterns, Range Vegetation Types, Range Conditions and Regional Comprehensive Plan Maps. Analysis and Output: The ERGIS system has a number@of software packages for manipulation of digitized or scanned data. The packages are of two types: (1) software for cellular format processing, and (2) software for conversion to polygon format on which design has recently started. The software was written in FORTRAN with a minimum of machine-dependant instructions. Following is a brief explanation of the. major programs. (1) ACRE - calculates acres by counting cells. B-24 Versatec Matrix Plotter iD.C.C. i-Computer Broomall Ind. ,@right Ind. @(D-116) Tape Durive GP-100 as system Raster Scanner Controller v Tektronix- CRT & ..Control Console Figure 2. B-25 (2) AGGREGATE - reduces the number of cells on a map by aggregating numbers of cel Is into I arger single cells based on dominant types of the original cells. (3) CORRELATE - can composite up to five maps in a single computer run. The descriptors insert& d in a new map are determined by the descriptors of the old maps and thle way the user has set the program to run. (4) INPUT - can take dot 6 from other ceiluar mapping systems and since it uses 80 records it is very useful in accepting data punched on cards. (5) JOIN - is used to join maps together. Since the EPD's raster scanner can I scan maps at a maximum size of 9j- inches by 23 inches, biger maps must be 2 reduced or cut into smaller segments. Program JOIN either photograph ical Iy can join scanned segments into one map. (6) OVERLAY - is utilized in compositing if certain types of maps do not have descriptors or inventory patterns covering an entire study area. Thus, when one map is overlain on another, cells with a descriptor of 0 will not over- w 'te cells with non-z@ rodescriptors. rr (7) PRINT - sends a map file to a line printer. It prints each cell in a map as a character on a line printer. Up to triple printing is utilized. (8) RECOVER - will make' line maps from printer maps. (9) SREW - is used to extr act a piece of a map or to extend the sides of a map. Conclusions ERGIS has been established for u :se in land and resource management which is data manipulation related. It has been their fi riding that for a regional or statewide data bank with large amounts, of data that needs to be digitized, constantly maintained and updated with a minimum of errotl an automated digitizer was essential. The important capabilities of ERGIS are: 1) file separation and file mergence in order to achieve random output ability, 2) vectorization of scanner data which can be printed or plotted, and 3) conversion of a polygon data base to a fine-cell grid data base until polygon manipulation is deemed feasible. (No cost or time estimates were readily available for ERGIS). B-26 Uses of the System Some uses of the system have included- 1) The Montana Dept. of Community Affairs has been digitizing and manipulating data for a number of natural resource and socio-economic characteristics primarily for use by land use planners, expecial ly at the local level. The Dept. of State Lands.is using the system in mine siting studies. 3) The Dept. of Fish and Game is generating'computer maps for acreage calculations according to species habitat and ownership. 4) The Energy Planning Division is using the mapping capabilities in routing trans- mission lines. As well, they are using the system for monitoring pollution from power plants and other industrial sources. 5) A number of Indian tribes are investigating using the system for managing tribal lands. B-27 V. Texas Natural Resources Information System Introduction to TNRIS The goal of the Natural Resource s Information System for Texas is to facilitate the fulfillment of specific statutory responsibilities and administrative needs of State agencies that must plan, developm operate, man age and conserve the natural resources of Texas. The system was designed to provide a flexible mechanism for maximum availability of natural resources data/information, consistent with cost and efficiency, to state, federal, regional, local and private entities forlsupport of state resource programs such as water resource planning, coastal zone management, land use planning and energy conservation activities. The TNRIS is not only a system in traditional engineering terms. It is defined as a service mechanism for (1) assembling data in both machine processable, and non-machine processable form; (2) processing raw data into meaningful data; (3) adjusting and organizing processed data into forms and formats suited for modern storage retrieval and manipulation procedures; (4) storing these data in a systemized information basq (5) disseminating data from this base of information; and (6) mdnipulating this data into maps, graphsi models and study plans, and simulation systems@ needed to manage natural resources as determined by user requirements. The heart of the NRIS is an information base which was designed to meet the needs of the user agencies. Although comprehensive in nature and compatible withexisting and planned federal and state systems, the data base is not completely centralized. Each acency has continued to be responsible for maintaining irls own information files. NRIS organization closely follows the design of the United States Geologic Survey National Water Data Exchange (NAWDEX) System. NRIS and NAWDEX are actually a B-28 combination of the linked network organization approach and a hierarchal approach, where all user agencies make up the linked network and a 'Systems Central' provides a coordinating point of contact and gives a required hierarchy. The basicadvantage of this type of a system for Texas has been that user agencies continue to maintain internal data storage and retrieval systems for the data that they collect and use. Systems Central provides an interface between users so that data transferswhen required, are easy. The system also provides a central index available to all users, and is the processing site for external requests for use of the system. TNRIS History A TNRIS was established in.1972 but a n-umber of important events occurred previous to this. Primary, was the creation of a "centralized data bank incorporating hydrologic data gat.hered,in Texas" by the Texas Water Development Board in 1967. Then in 1971 an Inter- agency Council decided that this Texas Water Oriented Data Bank could serve as a foundation for a complete natural resources data system.. 1971 marked the completion of the natural resources data identification and.categorization project. At this time there was also.developed aNRIS conceptual design with recommendations for implementation. Subsequently, the system actually became active with the compilation of NRIS file descriptions and'a survey of NRIS data/linformation needs. Work on geographic data manipulation and analysis capabilities was initiated in 1974 and was operational by 1975. Specifics of the Texas NRIS. Basicto this system as to any large information system are three elements: (1) the users of the system, (2) the data base, arid (3) the information services which are provided. B-29 As the TNRIS was designed to serve a wide range of users (decision maker, planner, technician, research, and the general !public) the data base reflects this diversity. Though totally comprehensive and syste.matical@ly organized along defined lines, the data base is generally not centralized. Each agency maintains data bases which exist in both machine processable form. The following CaTegories and subcategories form the present TNRIS data base: 1. Base Data (i.e., @base maps, geodetic control). H. Meteorology. A. Climate B. Air Quality C. MarisActivities (i.e., permists, etc.) Ill. Biologi.c Kesources. A. Animals B. Plants C. Microrgan isms D. Mants Activities I V. Water Resources A. Surface B. Subsurface C. Man's Activities V. Geologic and Land Resources A. Surface B. Subsurface C. Man's Activities V1. Socio-economic Resources A. Social B-30 B. Economic Commerce D. Government E. Archeologic The information base has been implemented using standard codes and procedures designed to ease use of the syste m and for system compatibility. TNRIS services include providing information on data abailability, providing basic data and information retrieval, and providing a number of data analysis capabilities. Specifically the range of TNRIS services include: (1) Data Inventories (2) Data Reports (3) Technical Studies (4) Environmental Impact Assessment Procedures (5) Preparation of planning and resource management models (6) Project management (7) Assistance in project design, operation and control. The system, for geographically specific data such as soils and land use, uses a polygon mapping system. The system enters data based on the digitization of line segments for polygons, lines for linear data, and points for point data. The system is based on techniques proposed for the USGS, Digital Cartographic Data Base. The system has the capability to transform a file, between any two map projections defined to the syste, as well as, to geodetic coordinates. Presently availabh a projections are Mercator, UTM, State Lamberts Conformal Conic, Albers Equal Area and Orthographic. Within the system data processing involves the chaining of three data files. The largest is the chain file which lists X, Y coordinates and contains feature item coders for the left B- 31 and right sides of all line segments. The next files, the definition file, contains the fourt major data types: points, lines, area, and networks. The third file is an easily updateable index for all definitions and boundaries Standard color cartographic plotting and interactive analysis and display are system output modes. Analyses include overla of up to eight data sets, cross tabulation, area and y distance calculations and a number of s Pecific procedures such as variety analysis. Work is currently underway for incorporating U@NDSAT Digital imagery files and pattern recognition/ classification software into the system. TNRIS Summar It appears that the system has been able to accomplish three of its basic objectives: (1) makes data available f a 6sers, consistent with cost and efficiency, rather than just to source agencies; (2) it has minimized redundant data collection programs; and (3) provides data analysis @capabilities which would be beyond the resources of a number of user agencies. System implementation in Texas did require a close working relationship between the people who would be operating the system and system users. The system central is currently staffed by six persons who are a mixturelof computer analysts and natural resource planners and managers. (Exhibits 1 through 4 ill trated numbers aspects of system operation and us design). B-@112 /0- --@N Requester T WC GLO TFS TWOB TPW' TIC UTBEG .................... ....................... ......... **-,--",* 11 ....................... .................. ... . . ........... ............. ............ TRRC TWRC SYSTEMS CENTRAL TWDB TSHD ACIB. TDA THD LEGEND NRIS Agencies Agency Data/information Holdings 0 NRIS and WOOPS.Agencies 10 Flow of Data Request SYSTEMS CENTRAL Flow of Response to Data Request Exh i b it I NRIS Data and Information Handling Situations Request Situation I B-33 T GLO TFS TWOB TPW UTBEG TIC .............. ........... .......... TRRC TWRC SYSTEMS CENTRAL TWDB TSHD ACB THD TDA Outsid Request r LEGEND N R IS Agencies Agency Data/ Information Holdings 0 NRIS and WOOPS Agencies 0 Flow of Data Request . SYSTEMS CENTRAL Flow of Response to Data Request 0 T DPW Exh i bit 2 NRIS Data and Information Handling Situations Request Situation 2 B-3h TSWC GLO TFS TWOB TPW. SYSTEMS CENT RAL UTBEG TIC .......... ....................... I'll .................. ....................... ....................... .......... *.*.*'.* ... ...I ............ ...... ......... ................ .............. ........... TWRC TRRC TWDB TSHD ACB TDA THD Outside Requester LEGEND N R IS Agencies Agency Data/Information Holdings 0 NR IS and WODPS Agencies 10 Flow of Data Request 0 SYSTEMS CENTRAL --- //--* Flow of Response to Data Req uest Exhibit 3 Outside st 6Reque er NRIS Data and Information Handling Situations Request Situation 3 B-35 6. F, COMPLETION WODPS REPORT OF WODPS -0 TO iCNRE ICNRE CATALOGS ON NRIS NRISTASK FORCE 14 FISCAL-Y R,1472-,". IDENTIFICATION IN 11 AL PR GRESS TWOOB DESIGN AND CATEGORIZATION REPORT TURAL OFNA ON NRIS AND PROCESSING E 8.1,1973 RESOURCES DATA F I V t7 IF L 79 nw@ INITIAL INVENTORY EXPANDED 4RIS OF MACHINE CON TWOOS "SYSTEMS PROCESSABLE DESIGN C PROCESSING ENTRAL" NRIS REPORT STAFFING DATA -3 F1 ESTABLISHMENT EXPANDED OF INITIAL E 0 D C NRIS NRIS 0 v I C 0 CENTRALINDEX INITIAL U E R C M SYSTEMS CENTRAL" C R E 0 M NRI A A C U U REPORTING S AND T L T N N PROCESSING TECHNICAL SUPPORT I L I T 0 0 1 STAFF C DATA LIBRARY N N N A G T 0 N -T- INITIAL NRIS INVENTORIES - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - REPORTING EXPANDED NRIS INVENTORIES ----------------------- ------------------ L ----------------- Lj WORK COMPLETED TO DATE Exhibit 4 Schedule of NRIS Implementation B-36 VI Statewide Geographic Information Systems - Summary and Evaluation. A comparison of a number of existing and developing systems permits a number of observations concerning the State-of-the-art at' such systems. Data A Digitization is technically the major hinge in the information. flows of all systems served. The process of creating an error-free file is critical because current hardware and software cannot intelligently ignore or correct non-logical errors, The process found in all systems is essentially- a. pre-edit of graphic data b. digitizing C. correction of digitizing errors cl. structuring the data file B. The development of interactive data inputting capabilities is of considera ble aid in correct- ing digitizing errors. Also such capabilities permit browsing a data file (i.e., Montana, ERGIS). C. it has proven expensive to interface line and point data with cell or polygon data once a system has been designed, so file structure should be designed with the possibility of this interface in mind (MAGI, ERGIS). Informational Outputs A. User expectations of automated systems have been generally too high if users know nothing or little about specifics of the system being utilized or developed (LUN,R, MAGI). B- 37 B. Standard statistical. packages such 'as SPSS and BIOMED have not proved adequate to meet the needs of spatial data analysis. There are statistical methods to handle spatial data, but in general, they have not, been adequately developed within the context of existing systems. Such statistical procedures recognize the potential spatial dependance of one observation on another (LUNK).@ C. it is critical that all users understa!nd the manipulative techniques being utilized so system outputs can be easily used in user decision making. This is often not the case and many systems systems manipulative capabilities are not used or are used incorrectly. Many state systems have utilized go-between processes as a part of user education programs. In this process, system developers and sponsors act a sintermediaries between various users and the system, translating both general and speci fi cuser problems into terms to which the system can re- spond. Currently there are no explicit systematic mechanisms by which to determine the amount and types of increases, in userperformance by utilizing information that comes from the system. Often if there is no per ceivable change in decision making it can be fatal tc the continuity and survival of the information system. The systems reviewed have exhibited that there is a clear need for either the sponsors or designers of the system to take the initiative and actively advocate use, of the system. This can only be accomplished if the systems staff includes personnel cap@able of understanding user problems (all systems). System Management Institutional Framework A. With all systems there have been significant time delays in project approval, the purchase of hardware and software, and hiring and training staff to operate the systems. Generally, past system programs have not been structured to reduce the adverse effects of such delays (LUNR, MAGI). Bl-38 I B. Systems initially must win the support of high level decision makers. The main problem appears to be overcoming negative attitudes and misconceptions .,which exist concerning the use and utility of computers (NORTH CAROLINA). To accomplish this.. the following groups must be included in system design. a. the data gatherers; b. the systems designers; and c. the anticipated user group. These groups must collectively identify the data supplies, evaluate data supplies with respect to the needs of the user groups'. and solve the problems of data flow from the. supplies to the system. C. Documentation of every aspect of system development and operation or significant problems arise with respect to: 1) inevitable staff changes; 2) the transfer of the system or of particular aspects of the system from outside contractors to system sponsors. Also, staffing must recognize the following activities and stages in system development: 1) system conceptualization; 2) system debugging; and 3) operation, maintenance and updating. D. It is not proved satisfactory to rely heavily on universities or outsid e organizations such as consulting companies as places to build information systems even though the growing organization temporarily avoids staffing problems. E . There have been significant problems in maintaining fiscal continuity for the development of information systems, given that such development generally takes anywhere from 3 to 10 years to complete. B- 39 F. System publicity has proven a problem area because if development takes a number of years,it is necessary to report progress on a regular basis. Successful mechanisms for accomplishing this have been series of deliverable products integrated with user. education programs. The user education process must start at the current level of capability of the user rather than at the systems projected level . Education is not how to use, the system but rather how the users problems can be handled in quantitative terms using data. It probably should be assumed that the @user must be shown approaches to problem solving which are entirely new. It is impractical to assume user feedback unless it is part of user education for a user cannot provide valid feedback if his capabilities are not equal to the systems manipulative powers. Other systems reviewed but not detailed here include: The Minnesota Land Management Information System The Canadian Geographic Information System The Alabama Resource Information System The Hawaii Information System The Composite Mapping System of the Federation of Rocky Mountain States B--)L0 APPZIDIX C 3/24/78 A PROPOSED NATURAL RESOURCE INFORMATION PLANNING PROGRAM FOR MANE The basic purpose of this program is to provide an organized framework for coordinating. Natural Resource Information planning activities of State agencies and to coor dinate State activities with those atother levels of government. Implementation of this program should provide for efficient se of av ilable funding and substantially increase the usefulness of existing natural resource infor- mation. u a The.Prograrn would consist of six functional areas: 1. Indexing H. Collection Planning I If. . Collection IV. Storage and Retri.eval V. Interpretation and Analysis V1. Distribution 1,NDEXING Essential to a Natural Resource Information Planning Program is the ability to quickly and easily determine what natural resource information exists that is relavent to a given planning or manage- ment activity. To 'orovide such a determination, a central automated index of all existing natural resource information pertaining to Maine needs to be established and regularly updated. Once such an index was established, a number of special purpose tools to meet specific planning and manage- ment needs could be developed. These might include: a. Printouts containing all index entries for a specific geographic area (for example Regional Planning Commissions or Minor Civil Divisions) b. A document listing all available digital information files, or c. A document listing all spatially distributed resource information (similar to the Wisconsin Inventory of Land Resources Data). Such a document is currently being prepared in preliminary form by the State Planning Office. The index would be available for special purpose searches as well. For example when an agency -ironmental Impact Assessment for a project, a special searc, or contractor needed to prepare an Env could be.run that would identify all existing, relevant natura@ resource information. C-1 -2- Substantial progress towards establishing such an index has been made by TRIGOM (The Research Institute for the Gulf of Maine). Recentl y TRIGOM was funded by the State Planning Office to conduct a study of how the transfer of natural resources information from sources to users could be expedited.* This study recommended that: I . A natural resource information center be established within the Maine State Library. 2. The center prepare and admin ister an index of Maine natural resource infor- mat.ion. This index would consist of: a. Affiliation with the National Cartographic Information Center (NCIC) and use of the NCIC remote-sensing imagery index; and i b. The Maine Index (as proposed by TRIGOM) for non-i magery information. H. COLLECTION PLANNING A natural resource data collection planning process.would be established and an initial Natural Resource Data Collection Plan prepared for Land and Water Resources Council approval. The planning process would provide for periodic updating of the Data Collection Plan and certain of its component parts. Activities/Reports to be accomplished as part of the Natura I Resource Data Collection Plan would include: A. Inventory of Resource Information The existing natural resource data base would be inventoried and major items would be published in loose-bound form as a handbook.. The handbook would be updated annually by the State Planning Office and would be modeled upon the "Inventory of Wisconsin Land Resources Data". The handbook could be derived from information contained in the MAINE Index and updated through the Index. *Natural Resource Information Transfer Study, TRIGOM, January, 1978. Available upon request from the State Plannin'g Office. -3- B. Data Needs Assessment Existing and potential natural resource data users at state, regional, and local levels would be interviewed as part of a process to assess user needs. Survey results would be reviewed and a final data needs report prepared. The user needs assessment would be updated on a periodic basis.(every 3 - 5 years). C. When possible, planning reports would be prepared on major categories of natural resource. data being collected or under consideration for collection. These reports would investigate the relationship between the resource type and planning needs, and should present recommendations concerning whether or not data should be collected for planning purposes in that category, how the data should be collected (for example, at what scales and level of detail, and what kinds of units should be mapped, etc.),, and geographic priorities for collection. Ideally, these reports would be prepared prior to initiating new collection efforts. Categories of data to be considered for report, preparation include: I . Bedrock Geology (completed in draft form) 2. Surficial G.eology 3. Soi Is 4. Vegetation 5. Wildlife Resources 6. Estuarine Resources 7. 'Marine Resources. 8. Water Data (report in progress) D. Technical Reports A number of special-subject technical planning reports should be conducted as a prelude to preparation of the Data Collection Plan. Subjects of these reports would include: C-3 -4- 1 Standardized Base Mapping 2. Projected Data Outputs from Federal Programs through 1985 1 3. A Review of Selected S, ate Resource Information Planning Activities (recently completed by the Council of State Governments). E. Plan Formulation All reports would be reviewed by the Advisory Group prior to their completion. Report findings and recommendations would$e reviewed and used in preparing a first draft of the Data Collection Plan. The Draft Plan would be expected to state the following: 1 . Unmet Data Needs 2. Unmet Data Needs requiring State action for timely collection (including standards for collection of such data) 3. Priority for state collection efforts 4. Implementation Programl(5 Year) 0. Col lection b. Coordination 5. Research Needs Once priorities for State collection efforts had been set, special studies would be conducted to develop and review detailed specifications for high-priority data categories.. These specifications would include: coding, mapping unit composition and size, accuracy, methodology for collection and presentation, etc. The initial Draft Collection Plan would receive wide review. The Advisory Committee would forward a final draft to the La nd and Water Resources Council with recommenda- tions for implementation and other action as appropriate. Questions that cannot be re- solved within the Advisory Commirre e would be forwarded to the Council for resolution. The Data Collection Plan would be partially updated on an annual basis, and fully re- viewed and updated every fivc )rt;:Uf@. C-4 COLLECTION -5- The Data Collection Plan would be implemented through collection activities at all levels of government. - Collection activities by each level would be coordinated. in the following ways: A. Federal Collection Activity Federal collection activity would be monitored. The State would submit a statement of its priorities to the various federal collection agencies to pro vide maximum consistency with the Data Collection Plan. Formal liaison would be established between the Land and Water Resources Council and the federal collection programs via the Adv isory Com- mittee and the data planning coordinato,r,at the State Planning Office. B. State Collection Activity State agency collection acti vities would be coordinated through the collection planning process to provide consistency with the Collection Plan and to avoid duplication of effort. A data collection clearinghouse w ould be established in the State Planning Office. All state and regional agency proposed data collection activities.would be submitted to t e clearinghouse and reviewed'by the Council's Advisory Group. Comment. would be made concerning consistency with the Collection Plan. Agencies would not be required to be consistent with the Collection Plan, but the Advisory Committee could recommend that the Council review proposals where it, appeared that consistency could be achieved with- out extra cost and the agency inv-olved had not adequately made a case for lack of con- sistency. C. Regional Collection Activity The Land and Water Resources Council would circulate the Cal lection Plan and would re- quest that regional agency col lection activities be consistent with the plan where possible. Where regional collection activities were funded from State or Federal programs, collec- tion activities would b e subjec t to the same formal clearinghouse procedures as state agencies. D. Local Collection Activities Where appropriate,- Classification Codes, Standard Base Maps, etc. would be provided C -6- to communities engaged in local data collection. They would be encouraged to be con- sistent with provisions of the Collection Plan. IV. INTERPRETATION AND ANALYSIS A. Interpretation Reports would be published in handbook form to assist planners at all levels in using re- Source data. These handbooks would provide an indepth understanding of the data collec- ted, how it was collected andhowlit can be used for resource planning and management. Subjects for handbooks might include: I . Surficial Geology (in progress) 2. Wildlife Resources 3. Groundwater (in progre'ss) 4. Surface Water 5. Soils 6. Vegetation 7. Intertidal areas (in prog ress) 8. Marine Resources 9. Estuarine Resources B. Analzsis Included under this activity would be the Council's program for identification of areas suitable for development and gener I resource analysis activities of state agencies. It would not normally include specialized analysis, conducted by a single agency to meet inhouse needs that is not broadly useful to other agencies (for example, specialized wild.- life population analysis conducted by the Department of Inland Fisheries and Wildlife) 1 . Identification of areas suitable for development Resource characteristics, incl'uding those of geology, soils, and slopes would be analyzed and rated to guide selection of favorable sites. Where possible, standard analysis methodologies would,be developed and adopted by the Council so that indi- C-6 -7- vidual agencies conducted such analysis for specific program needs could contribute to the larger, statewide program with their efforts. 2. Other Analysis Activities As the State's resource data base is developed, substantially increased analysis activity can be anticipated. Where possible, methodologies for such analysis would be standardized, or at a minimum, reviewed and commented upon by the Advisory Group. Such activities might incl ude special siting studies, resource analysis to support policy studies, etc. V. STORAGE AND RETRIEVAL As long-term natural resource information needs begin to be classified through the process of developing a Natural Resource Data Collection Plan, an organized system of storage and retrieval of such data will need to be developed. Substantial progress in this category will likely have to wait until work in earlier categories is well underway. A number of special reports would be conducted to support deve lopment of an organized system for storage and retrieval. These would include: I . . A study of the feasibility of establishing a Geographic Information System at the State level for the storage, retrieval and analysis of specifically dis- tributed natural resource data. (complete in draft form) 2. A study to prepare recommendations concerning how to improve the distribution of existing natural resource information to existing and potential use rs. (completed) C-7 -8- A. DISTRIBUTION The final function of a complete information system is distribution of information to users. This function should include both the simpl distribution, as wel I as outreach activities to e i improve awareness of data, to point out how' available data can be used to solve problems, etc. A study has been conducted under the Coastal Program that is related to this function. This was the Natural Resource Information Transfer, which was completed by TRIGOM in January. C-8 APPENDIX D REFERENCES CITED Adams, V.W. Earth Science Data in Urban and Regional Information Systems - Baxter, F.P. and T.L. Cox. A Computer Assisted Land Management Decision System. Tennessee Valley Authority. Dec. 1975 Calkins, Hugh. "An Information System and Accountability Theory in Plan- ning. P.H.D. Dessertation, University of Washington, 1972. Crowder, R. "L.U.N.R.: Land Use and Natural Resource Inventory of New York: What it is and How it is to be used." Albany, Office of State Planning, 1972. Dueker, K.J. and R. Talcott. Statewide Land Use Analysis and Infor- mation Requirements. working Paper 13, Institute of Urban and Regional Research, University of Iowa. 1974 Durfee, R.C. ORRMIS: Oak Ridge Regional Modeling Information System. Oak Ridge, Tennessee. ORNL. 1974 International Geophysical Union. Second Interim Report on Digital Spacial Data Handling in the U.S. Geological Survey. IGU Commission on Geographical Data Sensing and Processing. Ottowa, Canada. 1976 Tomlinson, R.F. Geographic Data Handling: The Proceedings of the UNESCO/ IGU Second Symposium of Geographic Information Systems. Ottowa, Canada: IGU Committee on Geographic Data Sensing and Processing. 1972 U.S. Department of the Interier. Information/Data Handling: A Guidebook for Development of State Programs. U.S.D.I. R.A.L.I. Program, Washington, D.C. 1975. D-1 Wright, Linda. Maine Land Use Planners: A Survey of Resource Information Needs (draft). Orono, University of Maine, 1977. Murray, Timothy. Honey Hill: A Systems Analysis for Planning the Multiple Use of Controlled Water Areas. Alexandria, Virginia: U.S. Army Engineer Institute For Water Resources. 1971. Pencker, T.K., P.M. Mark, and J.L.P. falty Geopraphic Data Structures Cam- bridge. laboratory for Computer Graphics and Spatial Analysis. Howard. 1973. Schweitzer, R.H. Mapping Urban America with Automated Cartography. Washing- ton, D.C. Geography Division, Bureau of the Census. 1973. Steinitz Rogers Associates, Inc. The Santa Ana River Basin: An Example of the use of Computer Graphics in Regional Plan Evaluation. Campbridge, Massachusetts. 1975. Talcott, Richard ana Kenneth Dueker. Government Policy and Decision Making Implications for Land Resource Information System Design. The Institute of Urban and Regional Research. University of Iowa. 1974. Texas Natural Resource Information System File Description, Jan. 1974. D-2 I r V -i I I., I I i I I i I I 1 4 1 I u 1, .liallmliffill" u 3 6668 14101 8533