|
OR/MS Today - August 2006 Forecasting Software Survey Forecasting Helping to fine-tuneyour choices. By Jack Yurkiewicz The role of forecasting in business, medicine or life is demanding and elusive. Executives at Warner Brothers had box-office figures in mind for "Superman Returns," but some "Pirates" may have hijacked a few of those dollars. ABC-Disney may have been pleasantly surprised by the ratings of the recent Mavericks-Heat NBA finals based on previous numbers and the host markets. Monitoring my own LDL values is much more than just an academic exercise. The examples can go on and on. When the results are found, the practitioner does an investigation of what went wrong or right. The methodology varies, from a "seat-of-the-pants" judgment guess, to an analysis of historical data using some forecasting software. If you are doing the latter, this sixth biennial edition of OR/MS Today's forecasting product survey may help you choose that software. The forecaster who needs forecasting software has different choices from those of just a few years ago. The dealer market is segmented into two groups: those who sell stand-alone, dedicated forecasting software (e.g., Forecast Pro) and those who offer a general statistical analysis program (e.g., SAS, Minitab, NCSS, Systat, etc.) that includes forecasting techniques. The former group is smaller and says it caters to different and additional needs of users who do forecasting and might not be satisfied with the features and capabilities of the latter group. One company, SPSS, with its Trends module, says it falls into both groups. In the previous survey, I mentioned a "newer branch," programs that are add-ins to Microsoft Excel. This branch has grown; typically these are statistical offerings (e.g., StatTools) with forecasting features, but are less comprehensive than the stand-alone general statistical programs. As in previous surveys, I have broken down the forecasting software into three groups: designated automatic, semiautomatic and manual. Automatic software performs some diagnostic test on the data and then recommends the forecasting technique. It then optimizes the parameters of that technique by minimizing some statistic and gives the forecasts, plot and summary statistical measures. For example, using a data set that tracked motion picture box-office returns, Forecast Pro, an automatic program, working in its "Expert" mode, gave the results shown in Figures 1 and 2.
The software also gave a graph including 95 percent confidence intervals for the forecasts, various "goodness of fit" statistics [mean absolute percentage error (MAPE), mean absolute deviation (MAD), Box-Ljung, Bayesian Information Criterion (BIC) and others] and forecasts for the periods I specified. Other automatic products offer similar output. Of course, with automatic software, the user can veto the recommended technique and manually specify the forecasting methodology. Semiautomatic software asks the user to specify the forecasting methodology for the input data, and the program then finds the optimal parameters by minimizing some statistic. For example, Figure 3 shows the dialog box from NCSS, a general statistical analysis program that falls into the semiautomatic group. Using the same box-office data, which had both trend and seasonality, I chose that (Trend-Season or Winters') method. The program permits work in the manual mode if I enter the three smoothing parameters, or offers to find the optimal ones if I choose a specific search method (e.g., minimize mean square error). The software then gives optimal parameters, the resulting forecasts, graphs and various statistical measures.
Finally, manual software asks the user to choose the model and the parameters for that model. For example, Figure 4 shows the dialog box from Systat, a general statistical analysis program that falls into the manual group. The user chooses exponential smoothing as the methodology, and the dialog box asks for the data and values for the three smoothing parameters. The software then gives the graph, forecasts and the accompanying statistics.
All three groups have certain advantages but come with pitfalls. The automatic products may allow the novice or inexperienced user to make forecasts, but could lull some users into using the program as a crutch. For example, when I recently asked a particular practitioner why he chose the Box-Jenkins model, he responded, "The computer told me." Manual software gives the user complete control of the forecasting process, but finding the correct parameters of the model can be a tedious trial-and-error procedure. The software should allow the user to withhold some portion of the data and find a fit for the rest. Many programs do not permit this. The software output can vary widely. Some products just give the forecasts, a graph of the data with the forecasts and a few statistical results. For example, almost all will give the MSE or the MDD, but far fewer give the Akaike Information Criterion (AIC), Ljung-Box statistic or the BIC. Experienced forecasters may want statistical tests on the within-sample errors, and, after the model is found, may also want the statistics for the out-of-sample errors. Many products will not do this. The user should be aware of the nature of the time-series that he or she is analyzing. It may be too small for a particular procedure or have missing values that may preclude using the software at all. It may exhibit extreme volatility or have statistical outliers that could fool some programs, yielding poor forecasts. If the data does exhibit seasonality, the user should know the periodicity. Some products make this "pre-analysis" easier than others. All programs will give a time plot of the data so the user can visually detect anomalies, while a few offer warnings and suggestions. One item that is difficult to ascertain is the "ease-of-use" of the software. If you use the product all the time, you may become acclimated to the software's idiosyncrasies and your "complaint level" may decrease. However, the occasional user may eventually stop using the product if the drawbacks outweigh the benefits. Since our last survey, some products that I tried and that previously exceeded my "complaint-level bar" have improved greatly. For example, all the products I tried imported my Excel 11 files seamlessly, and the operation was intuitive (sorry, I read the manual only as a last resort). My biggest complaint with some products is that I could not get the time-forecast plot into Excel for enhancement. With other programs, I found that saving the graphics output and trying to import it into Word became cumbersome at least and impossible at worst; the copy-paste sequence was the only solution. One issue that is still open from our last survey is "duplication." By this I mean, if you and I both choose the same model with the same data, will two automatic forecasting programs give the same results (i.e., yield the same model, identical parameters for the model and the same forecasts)? Invariably, the answer is "no." The reasons vary, from which statistic the software (e.g., MSE or BIC) is trying to minimize to the number of data points the software is using to start the procedure (the initial conditions). I have found that even when I tried this test on pairs of semiautomatic or manual software, and I worked in the manual mode (i.e., choosing the model and specifying the parameters), the resulting forecasts differed, probably because the products differed on their choice of initial conditions. The documentation frequently gave no clue as to how the software made its calculations, and the program often did not allow the user a choice on which statistic it should monitor. Based on reader and vendor feedback, we have tried to make this edition as comprehensive as possible. Still, finding all available products on the market is problematical. So for those users and dealers who feel slighted by our omission, please accept our apology, let us know of your existence, and we will try our best to include you in our next roundup. Even if the program does have the techniques and the level of automation you require, some features may vary. For example, Minitab finds the optimal exponential smoothing parameters for Holt's method, but not for Winters' method. Some programs limit the number of observations. A program might do Winters' method, but may permit only linear but not curvilinear trend, and the model may only be multiplicative, but not additive. Vendor support is important. Is it free for some period? Is it "live?" That is, do you actually get to speak with another person, or is support limited to e-mail exchanges? Check to see if there are free patches for the current and previous versions of the software. Before you buy, call the vendor and ask questions. My experiences and feedback from other users have shown vendors to be uniformly helpful and eager to assist you in making the correct purchase for your needs.
OR/MS Today copyright © 2006 by the Institute for Operations Research and the Management Sciences. All rights reserved. Lionheart Publishing, Inc. 506 Roswell Rd., Suite 220, Marietta, GA 30060 USA Phone: 770-431-0867 | Fax: 770-432-6969 E-mail: lpi@lionhrtpub.com URL: http://www.lionhrtpub.com Web Site © Copyright 2006 by Lionheart Publishing, Inc. All rights reserved. |