Monthly Archives: December 2013
Plotting data across time helps to reveal interesting patterns and relationships. This was true of a study of weather and temperature patterns in Illinois that was conducted by Carl von Ende from the Department of Biological Sciences at Northern Illinois University.
The goal was to visually compare the trends in temperate between two years, 2008 and 2009. The data, available from the National Weather Service Forecast Office (of) Central Illinois, gives temperatures (measured in Fahrenheit) and dates when the measurements were taken.
In this article, we will explore the steps needed to create such a plot, including the creation of new variables, using time and date functions and graph customization tools.
The data used to create the graph is presented below. To access this data set directly so you can perform this excercise yourself, please download the .STA file from here.
As you can see, there is only one date variable, and we need to obtain from this variable the year for categorization as well as the day of the year for plotting along an axis. Therefore, the first step is creating two variables with this data.
In STATISTICA, select the Data tab. In the Variables group, click the Variables arrow, and from the menu, select Add. In the Add Variables dialog box, add two variables after Temperature (in F).
After you click OK, two new variables are added to the data file, NewVar1 and NewVar2.
Double-click on the variable header for NewVar1 to display the variable specifications dialog box. Change the variable name to Day_of_Year. In the Long name field, enter a function that returns a numerical code for the day of the year. You can see this function and the required parameters by clicking the Functions button, which displays the Function Browser. In the Category list, click on Date/Time, and in the Item list, scroll down to and click on DTDAYOFYEAR.
This is the function that will be used to bring back a numerical code for the day of the year. As you can see from the description of the function, it returns a numerical code between 1 and 366 that represents the day of the year. Close the function browser and return to the variable specification dialog. In the Long name field, type =DTDayOfYear(‘Date’).
When you click OK, a message will be displayed, letting you know whether the expression is correct. If the expression is correct, click Yes and the variable will be renamed and numerical codes for the day of the year will be included in the cases for the variable.
Now, double-click on the variable header for NewVar2, rename the variable Year, and in the Long name field, type =DTYEAR(‘Date’). This will rename the variable and add a four-digit number for the year in each case of the data set.
You will now have the complete data set with two additional variables, Day_of_Year and Year. These variables will be used in the results graph.
To create the graph, select the Graphs tab. In the Common group, click Scatterplot.
STATISTICA is designed so that when creating a 2D Scatterplot, the most common options for creating a scatterplot are shown on the Quick tab, as shown below. On the Quick tab of the 2D Scatterplots dialog box, click the Variables button. Select Day_of_Year as the x-axis variable and Temperature (in F) as the y-axis variable. Click OK in the variable selection dialog box. Under Fit type, clear the Linear check box.
On the Categorized tab, in the X-Categories group box, select the On check box. Click the Change Variable button, and select Year as the categorization variable. Click OK. In the Layout group box, select the Overlaid option button.
Click OK to create the scatterplot graph.
We still need to convert the numerical codes to date format and to connect the data points by lines. To do this, double-click in the graph background to display the Graph Options dialog box.
Select the General tab for Plot, and select the Multiple lines check box.
Select the Scale Values tab for Axis, and ensure that the X Axis is specified. Then, for the Value format, select Date and the option for 17-Mar. Finally, under Options in the Layout drop-down list, select Perpendicular.
Click OK to create a graph that shows the data for Day_of_Year categorized by Year for Temperature (in F), as shown below.
STATISTICA Enterprise™ Selected by Toshiba Electronic Engineering Corporation for Advanced Analytics for Manufacturing Support
TULSA, OK, USA [Dec. 11, 2013] StatSoft’s STATISTICA Enterprise Platform will be deployed by Toshiba Electronic Engineering Corporation, a subsidiary of Toshiba (TYO: 6502 and NASDAQ: TOSBF), as the solution of choice for advanced analytics and data mining methods for manufacturing support.
StatSoft’s STATISTICA Enterprise is a comprehensive platform that delivers a range of analytic capabilities to support modern automated manufacturing enterprises, with methods ranging from basic control charting to the application of advanced data mining and pattern recognition algorithms.
“We are excited to have this opportunity to deploy our analytics platform to Toshiba Electronic Engineering Corporation and its many customers, and to support Toshiba’s continuous drive towards manufacturing excellence and quality,” said Win Noren, Vice President for Global Operations at StatSoft. “With StatSoft’s corporate mission of ‘Making the World More Productive,’ we are a great partner for Toshiba, whose products touch the lives of so many people worldwide.”
Adds Shizuo Sawada, President, Toshiba Electronic Engineering Corporation: “The selection of StatSoft provides Toshiba with a modern, proven, and comprehensive software platform that supports even the most advanced analytics in a way that is easily automated. This is critical, as our state-of-the-art manufacturing facilities rely on robust automation to deliver the highest quality products.”
About Toshiba Corporation
Toshiba is a world-leading diversified manufacturer, solutions provider, and marketer of advanced electronic and electrical products and systems. Toshiba Group brings innovation and imagination to a wide range of businesses: digital products, including LCD TVs, notebook PCs, retail solutions and MFPs; electronic devices, including semiconductors, storage products, and materials; industrial and social infrastructure systems, including power generation systems, smart community solutions, medical systems, and escalators & elevators; and home appliances. Toshiba was founded in 1875 and today operates a global network of more than 590 consolidated companies, with 206,000 employees worldwide and annual sales surpassing 5.8 trillion yen (US $61 billion). www.toshiba.co.jp/index.htm
About StatSoft, Inc.
StatSoft was founded in 1984 and is now one of the world’s largest providers of analytics software, with 30 offices around the globe and more than one million users of STATISTICA software. StatSoft’s solutions enjoy an extremely high level of user satisfaction across industries, as demonstrated in the unprecedented record of top ratings in practically all published reviews and large, independent surveys of analytics users worldwide. With its comprehensive suite of STATISTICA solutions for a wide variety of industries, StatSoft is a trusted partner of the world’s largest organizations and businesses (including most of the Fortune 500 companies), providing mission-critical applications that help them increase productivity, control risk, reduce waste, streamline operations, achieve regulatory compliance, and protect the environment.
StatSoft® is a registered trademark of StatSoft, Inc. All rights reserved.
StatSoft Solutions for the Healthcare Industry
Community Hospitals / Research Hospitals
As many hospital organizations have embraced the measurement and analysis techniques provided in the Six Sigma approach, STATISTICA serves as the analytics platform for Six Sigma programs and implementations of any size. Six Sigma’s emphasis on measurement and analysis requires a full-featured statistical analysis software system. STATISTICA provides all necessary analytic tools to empower the Six Sigma Green Belts, Black Belts and Master Black Belts with the analytic tools to explore data, determine the most important factors, and perform data-driven decision-making. These tools are integrated with the Hospital Information Management System to monitor and evaluate trends and performance in critical business processes and parameters such as lengths of stays, frequencies of diagnoses across time, breakdowns of diagnoses across hospital sites, billing delays, and times until receipt of payment, etc.
STATISTICA provides all of the analytic tools to support researchers in their investigations, from exploratory graphical tools, comparisons of groups, and data reduction techniques (e.g., Principal Components, Factor Analysis) to Report templating, generation (batch or user-initiated) and automatic distribution to a wider audience.
STATISTICA provides the analytics platform to automate the analyses necessary to monitor claims. STATISTICA provides a robust platform for generating Reports summarizing historical trends and new developments in type of claims, breakdowns by geography, etc. STATISTICA‘s comprehensive set of data mining methods are deployed as robust detectors of outlier and potential fraudulent claims.