SciELO - Scientific Electronic Library Online

 
vol.29 suppl.1Flours and starches made from cassava (yuca), yam, sweet potatoes and ñampi: functional properties and possible applications in the food industry índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados

Revista

Articulo

Indicadores

Links relacionados

  • No hay articulos similaresSimilares en SciELO

Compartir


Revista Tecnología en Marcha

versión On-line ISSN 0379-3982versión impresa ISSN 0379-3982

Tecnología en Marcha vol.29  supl.1 Cartago feb. 2016

http://dx.doi.org/10.18845/tm.v29i5.2520 

Artículo

HidroEsta, software for hydrological calculations

Máximo Villón-Béjar1 

1Telephone: (506) 550-2595. Agricultural Engineer, specialist in Soil and Water Resources Engineering, and Informatics. Professor, School of Agricultural Engineering. Instituto Tecnológico de Costa Rica. Cartago, Costa Rica. E-mail: mvillon@itcr.ac.cr

Abstract

[11]

This investigation, “HidroEsta, software for Hydrological Calculations” is oriented towards the creation of a computing tool using Visual Basic intended to facilitate and simplify difficult calculations that must be carried out in hydrological studies.

[12]

The software allows making calculations of statistical parameters, as well as linear and nonlinear, simple and multiple regression calculations, and polynomial regressions. It is also useful for evaluating if a series of data can be fitted to a series of distributions, and calculating design events with a given probability of occurrence, based on a seasonal variation curve or a duration curve; analyzing a storm and calculating maximum intensities based on pluviograms data; calculating flow measurements taken with windmills or current meters; calculating maximum flows with empirical and statistical methods; calculating evapotranspiration and hydric balance.

[13]

Several numeric methods were used to solve the equations, selecting the most appropriate one for each case. The results of this work provide civil, agricultural engineers, and agronomists, hydrologists and other specialists who work in this field with a tool that enableZs them to carry out calculations, rapid simulations, and determine flow or precipitation for designs.

Keywords: Precipitation; flows; random variables; distributions; statistical parameters; mean; variance; hydrology

I. Introduction

Hydrological studies require analyses of large amounts of hydrometeorological information related to precipitation, flows, temperature, and evaporation, among others.

The data collected represent raw information, but if properly organized and analyzed, they provide hydrologists with a very useful tool which allows them to make decisions in the design of hydraulic structures [5].

To make the calculations, hydrologists have to confront a series of problems:

  • Processing the information is highly laborious.

  • In most cases, the equations to be solved are very complex and their solution requires using numerical methods.

  • Simulations carried out manually are time-consuming, due to the calculations required.

Due to the complexity of information processing and of the calculations, errors can occur, making it useful to have software that provide hydrologists with a tool that enables them to simplify all these processes, and even to simulate their results, allowing them to optimize their design.

Objective

Create software which will facilitate and simplify complex calculations, and the analysis of the abundant information required in hydrological studies.

II. Theoretical basis

Random variables

A random variable [1], is a function X, defined on a sample space S, which assigns a value to the variable for each point (or each result) of the sample space of an experiment.

Random variables are also known as stochastic variables whose values are real numbers which cannot be predicted with certainty before the phenomenon occurs; i.e., they occur randomly.

Discreet random variable

A random variable X is discrete when its values are restricted to a finite or infinite enumerable set.

Example: number of rainy days that occurred during the months of any year.

Continuous random variable

A random variable X is continuous when its values are within a continuous range, and may be represented by any whole or decimal number.

Example: the daily flow registered in a measuring station.

Distributions

The behavior of a random variable is described through its probability law, which may in turn be described in several ways. The most common is through the distribution of probabilities of the random variable [4].

Notation:

  • X⇒ random variable of the function

  • x⇒ specific value assumed by the random variable

  • f(x) ⇒ probability density function (PDF) (probability function, probability distribution of x)

  • F(x) ⇒ cumulative function (cumulative distribution function, CDF)

Distribution measurements

Arithmetic mean

Given the composite sample of n data x 1, x 2, ..., x n, the mean is defined as their algebraic sum, divided by the number of data. When the average is calculated for a population, it is denotedby µ, and when defined for a sample, it is denoted by x.

Mathematically speaking, the average of data that are not grouped is represented by:

where:

m = population mean

x = sample mean

xi = ith value of the mean

xii n = number of data of the sample or population

The median

The median is a single value of a set of data which measures the central element in the data [5]. This single element of the data is the closest to the middle, or the most central value of a set of numbers. Half of the elements are above this point, and the other half are below it.

If x 1 , x 2 , x 3 , ..., x n are data organized in increasing or decreasing order, and n is an odd number, the median (Med) is the datum at the center, i.e.:

Med =, for an odd number n

if n is an even number, the median is the average of the central numbers, i.e.:

, for an even number n

The mode

Is the value that occurs most frequently in a set of data, denoted by Mo.

For data grouped in class intervals, once the modal class is defined, the mode is calculated using the following equation:

where:

Mo = mode

Lm = lower limit of the modal class

d1= difference between the frequency of the modal and pre-modal class (the class before the modal class)

d2= difference between the frequency of the modal and post-modal class (the class after the modal class)

w = range of the class interval

In general, the modal class is that which has the maximum frequency.

Variance

The variance of the population (σ2), is defined as the sum of squares of the deviations of the data with respect to the mean, divided by the total number of data, i.e.,:

The variance of the sample (S²) [5], is obtained by dividing the sum of squares of observations of the data with respect to the mean by the total number of observations minus one, i.e.:

Standard deviation

The standard deviation is defined as the positive square root of the variance, i.e.:

(for the population)

(for the sample)

Coefficient of variation

This is a relative measurement of dispersion, which is related to the standard deviation and the mean, i.e.:

Bias

The bias is the statistics that measures symmetry and asymmetry [2].

The bias (γ) for data from the population is obtained with the following equation:

where:

The bias of data in the sample is obtained with:

Cs =

where:

Kurtosis

For data of the population the kurtosis coefficient (k), is defined by the following equation:

where:

The kurtosis coefficient for data in the sample [2], is defined as:

where:

Estimation of parameters

The parameters of a theoretical distribution are variables that have a defined value for each set of data. Once the parameters are defined, the theoretical distribution is also defined.

In general, a density function or a cumulative distribution function may be expressed as a function of the random variable, and in general as a function of its parameters. For instance, the density function of the normal distribution of random variable X, is:

where:

µ = location parameter

σ²= scale parameter

To define the function f(x), the parameters µ and σ² must be calculated.

Since all the values of the random variable are usually unknown, the estimation of the parameters is made based on a sample [4].

Methods for the estimation of parameters

To determine the numeric values of the theoretical distribution parameters based on data in the sample, several estimation methods are used [5]; in order of ascending efficiency, they are:

  • Graphic

  • Method of least squares

  • Method of moments (MOM) estimation

  • Maximum likelihood

Goodness-of-fit tests

The goodness-of-fit tests consist in verifying, graphically and statistically, if the empirical frequency of the series analyzed fits a previously selected theoretical probability function with parameters estimated based on values of the sample.

The purpose of statistical tests [4], is to measure the certainty obtained when a statistical hypothesis is made for a population, i.e., to evaluate the assumption that a random variable will be distributed according to a certain probability function.

The most used goodness-of-fit tests are:

  • Graphical fit

  • Statistical fit

Theoretical distributions

Hydrologists usually have a record of hydro-meteorological data available (precipitation, flows, evapotranspiration, temperatures, etc.). Based on their knowledge of the physical problem, they will choose a probabilistic model to use which will satisfactorily represent the behavior of a variable.

Normal or Gaussian distribution

A random variable X, is said to have a normal distribution [2], if its density function is:

or

for -∞< x <∞

where:

f(x) = normal density function of variable x x = independent variable

X = location parameter, equal to the arithmetic mean of x

S = scale parameter, equal to the standard deviation of x

EXP = exponential function with base e (napierian logarithms).

When the random variable X, is normally distributed with mean σ = X and variance (σ2 = S2), it is denoted as:

X ~ N ( X , S2)

Parameter log-normal distribution

The random variable X, is positive and the lower limit xo does not appear.

The random variable: Y = lnX, is normally distributed with mean γ y and variance σ2y.

These parameters are used to specify that the distribution is logarithmic, given that the mean and variance of X may also be used.

It is said that a random variable X, has a 2-parameter log-normal distribution [5], if its density function is:

for 0 < x <∞

Where µy, σy, are the media and the standard deviation of the natural logarithms of x, i.e., of lnx, and represent respectively the scale and the shape parameters of the distribution.

Parameter gamma distribution

It is said that a random variable X, has a 2-parameter gamma distribution [5], if its probability density function is:

for:

0 ≤ x <∞ 0 <γ<∞

0 < ß <∞

Where:

γ= shape parameter (+) ß = scale parameter (+)

Γ(γ) = complete gamma function, defined as:

which converges if γ> 0

Log-Pearson type III distribution

It is said that a random variable X, has a log-Pearson type II [5] distribution if its probability density function is:

for: xo ≤ x <∞

-∞< xo<∞

0 < ß <∞

0 <γ<∞ where:

x o = position parameter ß = scale parameter

γ = shape parameter

Gumbel distribution

The Gumbel distribution [5], is one of the extreme value distributions, also known as the Extreme Value Type I, Type I Fisher-Tippett or double exponential distribution.

The cumulative distribution function of the Gumbel distribution, has the form: F(x) = EXP(-EXP(-(x - µ) /α)) or

for: -∞< x <∞

where:

0 <α<∞, is the scale parameter

-∞< µ <∞, is the position parameter, also called the central value or mode

Deriving the cumulative distribution function with respect to x, the density function of probability is obtained, i.e.:

or

for: -∞< x <∞

Log-Gumbel distribution

The cumulative distribution function of the Gumbel distribution [5], has the form:

for: -∞< x <∞ where:

0 <α<∞, is the scale parameter

- ∞< µ <∞, is the position parameter, also called the central value or mode

If the variable x is replaced by lnx in the equation the cumulative function of the log-Gumbel distribution, or Fréchet distribution, is obtained.

Hydrometry is the branch of Hydrology that studies runoff measurement. Another term commonly used for the same purpose is flow measurement. To estimate a flow is to determine by measurement the flow that passes by a given section at a given moment [6].

Maximum flows To design [6]:

  • The dimensions of a channel

  • Drainage system

  • agricultural

  • airport

  • city

  • road

  • Channeling walls to protect cities and plantations

  • Sewers

  • Emergency spillways

  • Bridge span

The design flow must be calculated or estimated, which in these cases are the maximum flows.

Some methods to calculate the maximum flow are:

  • Direct method

  • Empirical methods

  • Curve number method

  • Statistical methods

  • Hydrological methods

Evapotranspiration

These are the total losses: evaporation from the evaporating surface (soil and water) + plant transpiration.

The term potential evapotranspiration [6] was introduced by Thornthwaite, and is defined as the total loss of water that would occur if there was never a lack of water in the soil to be used by vegetation.

Consumptive use is defined as the sum of evapotranspiration and water directly used to build plant tissue.

Since the amount of water used to build tissues is insignificant compared to that consumed in evapotranspiration, it may be said that:

Consumptive use » evapotranspiration

In irrigation projects, it is important to make initial calculations of the water required by crops. These water needs, which will be met through irrigation, are the evapotranspiration or consumptive use.

Methods based on meteorological data are used to calculate these amounts of water, of which the most well known are the Thornthwaite and Blaney -Clidde methods.

Materials and methods

Visual Basic, version 6.0 was used to develop the software, which permits the creation of 32-bit applications.

Numerical methods are be used to solve the equations, such as:

  • Newton Raphson algorithm

  • Secant method

  • Graphical integration

  • Lagrange interpolation

  • Romberg’s algorithm

To calculate theoretical cumulative distributions such as the Normal, Log-Normal, and Gamma, and as to estimate their inverse values, series development is used.

Many solutions of the equations are found using nomograms and tables. In the process of computation, they should be transformed into equations; the following will be used to determine the most appropriate equations:

  • Simple linear correlations

  • Simple non-linear correlations

  • Multiple linear correlations

  • Multiple non-linear correlations

Results

The product of this work is HidroEsta. The following figure shows one of its screens, with the calculation of the normal distribution:

Discussion

The results calculated using the application are in all cases more approximate than those obtained with nomograms.

HidroEsta represents a contribution to the simplification of hydrological studies. It is important because:

  • It provides an innovative and easy-to use tool for civil engineers, agricultural engineers, agronomical engineers, and other specialists who work in fields related to hydrological studies.

  • It simplifies processing of abundant information and complicated calculations.

  • Based on the information obtained, it allows simulation of design parameters for structures to be constructed.

  • It greatly reduces calculation time.

  • It permits obtaining optimal and economical designs.

Acknowledgement

Special thanks to the Vice-rectory of Research and Expansion of the Instituto Tecnológico de Costa Rica, which provided the time and material necessary to carry out this work.

Bibliographic references

[1 ] Haan, Ch.Statistical methods in hydrology. Iowa, State University Press, U.S.A., 1977. [ Links ]

[2 ] Kite, G.W.Frequency and risk analyses in hydrology. Water Resources Publications, Michigan, U.S.A.1977. [ Links ]

[3] Maidment, David.Handbook of hydrology.McGraw-Hill:New York - USA. 1993. [ Links ]

[4] Villón, Máximo. Diseño de capacidad de embalses por el método experimental - teoría de range. Tesis para optar por el grado de Magíster Scientiae en Ingeniería de Recursos de Agua y Tierra. Universidad Nacional Agraria “La Molina”. Lima-Perú, 1983. [ Links ]

[5] Villón, Máximo. Hidrología estadística. Taller de Publicaciones Instituto Tecnológico de Costa Rica:Cartago - Costa Rica. 2001. [ Links ]

[6] Villón, Máximo. Hidrología. Taller de Publicaciones. Instituto Tecnológico de Costa Rica:Cartago - Costa Rica. 2002. [ Links ]

Creative Commons License This is an open-access article distributed under the terms of the Creative Commons Attribution License