<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Pyzdek Institute &#187; Statistical Tools for Six Sigma</title>
	<atom:link href="http://www.sixsigmatraining.org/category/statistical-tools-for-six-sigma/feed" rel="self" type="application/rss+xml" />
	<link>http://www.sixsigmatraining.org</link>
	<description>Lean Six Sigma Training and Information</description>
	<lastBuildDate>Thu, 02 Feb 2012 17:43:46 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Where Do Those Six Sigma Statistics Come From?</title>
		<link>http://www.sixsigmatraining.org/introduction-to-six-sigma/where-do-those-six-sigma-statistics-come-from.html?source=rss</link>
		<comments>http://www.sixsigmatraining.org/introduction-to-six-sigma/where-do-those-six-sigma-statistics-come-from.html#comments</comments>
		<pubDate>Fri, 13 Jan 2012 18:16:03 +0000</pubDate>
		<dc:creator>Thomas Pyzdek</dc:creator>
				<category><![CDATA[Education]]></category>
		<category><![CDATA[Introduction to Six Sigma]]></category>
		<category><![CDATA[Six Sigma Tools]]></category>
		<category><![CDATA[Statistical Tools for Six Sigma]]></category>
		<category><![CDATA[action framework]]></category>
		<category><![CDATA[body of knowledge]]></category>
		<category><![CDATA[business courses]]></category>
		<category><![CDATA[compromises]]></category>
		<category><![CDATA[computer software]]></category>
		<category><![CDATA[email]]></category>
		<category><![CDATA[field of statistics]]></category>
		<category><![CDATA[improvements]]></category>
		<category><![CDATA[interaction]]></category>
		<category><![CDATA[lean-six-sigma]]></category>
		<category><![CDATA[mathematics courses]]></category>
		<category><![CDATA[probability plot]]></category>
		<category><![CDATA[project management courses]]></category>
		<category><![CDATA[psychology courses]]></category>
		<category><![CDATA[six-sigma]]></category>
		<category><![CDATA[statistical methods]]></category>
		<category><![CDATA[statistics courses]]></category>
		<category><![CDATA[subset]]></category>
		<category><![CDATA[sum of squares]]></category>
		<category><![CDATA[tradeoff]]></category>

		<guid isPermaLink="false">http://www.sixsigmatraining.org/?p=3706</guid>
		<description><![CDATA[In Lean Six Sigma we take information from a dozen or so statistics courses, project management courses, psychology courses, business courses, mathematics courses, etc. and put it into an action framework that can be used to make fast improvements.]]></description>
			<content:encoded><![CDATA[<p>A student of mine had numerous questions about the various statistics used in Six Sigma. Here is my response to him in an open email:</p>
<blockquote><p>The questions you are asking regarding “Where do these statistics come from?” require entire courses in statistics to answer. In Lean Six Sigma we take information from a dozen or so statistics courses, project management courses, psychology courses, business courses, mathematics courses, etc. and put it into an action framework that can be used to make fast improvements. We probably present less than 10% of the information you would receive if you sat through all of these courses, but we do so in less than 5% of the time it would take to complete all of these courses. It&#8217;s a tradeoff. We make the greatest compromises in the field of statistics. We discuss the use and interpretation of a select subset of statistics, and answer the question “where do these statistics come from?” by saying “they come from computer software.” While most are satisfied with this answer, some find the answer to be most unsatisfying. Judging from your questions, I suspect you are in the latter group.</p>
<div id="attachment_3709" class="wp-caption alignleft" style="width: 310px"><a href="http://www.itl.nist.gov/div898/handbook/prc/section4/prc427.htm"title="http://www.itl.nist.gov/div898/handbook/prc/section4/prc427.htm"  target="_blank"><img class="size-medium wp-image-3709  " title="anova-table-calculations-e-handbook-of-statistics" src="http://www.sixsigmatraining.org/2012/01/anova-table-calculations-e-handbook-of-statistics-300x118.png" alt="anova-table-calculations-e-handbook-of-statistics" width="300" height="118" /></a><p class="wp-caption-text">Two-Way ANOVA Calculations from E-Handbook of Statistics</p></div>
<p>Assuming you don’t have the time or the desire to take all of the courses relating to the Lean Six Sigma body of knowledge, but still seek answers to the specific statistics you asked about, I recommend the <a href="http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/free-e-handbook-of-statistical-methods.html?source=rss"title="E-handbook of statistical methods"  target="_blank">E-Handbook of Statistical Methods</a>. This reference source is free and very comprehensive. It’s easy to search and will give you the answers you seek. For example, I searched on the term sum of squares, which you asked about, and the search returned pages on the half-normal probability plot (your question about fig. 10.26,) 1-way ANOVA (several of your question were about these calculations,) and several other related topics. A search on ss interaction provides answers to your question about the calculation of this intermediate statistic.</p>
<p>Sorry I can’t address all of your questions via email, but perhaps the reference above will start you on your way to answers. I had the same questions when I started learning about quality improvement nearly 45 years ago, and I am still looking for answers to questions today. Have fun!</p></blockquote>
<p>Tom Pyzdek</p>
]]></content:encoded>
			<wfw:commentRss>http://www.sixsigmatraining.org/introduction-to-six-sigma/where-do-those-six-sigma-statistics-come-from.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Statistical Surprises and Absurdities</title>
		<link>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/statistical-surprises-and-absurdities.html?source=rss</link>
		<comments>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/statistical-surprises-and-absurdities.html#comments</comments>
		<pubDate>Mon, 19 Dec 2011 00:05:52 +0000</pubDate>
		<dc:creator>Thomas Pyzdek</dc:creator>
				<category><![CDATA[Six Sigma Tools]]></category>
		<category><![CDATA[Six Sigma Videos]]></category>
		<category><![CDATA[Statistical Tools for Six Sigma]]></category>
		<category><![CDATA[Webinars]]></category>
		<category><![CDATA[absurdities]]></category>
		<category><![CDATA[sampling bias]]></category>
		<category><![CDATA[selective data]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[surprises]]></category>
		<category><![CDATA[video recording]]></category>
		<category><![CDATA[webinar]]></category>

		<guid isPermaLink="false">http://www.sixsigmatraining.org/?p=3603</guid>
		<description><![CDATA[Topics discussed included sampling bias, misused and misleading averages, distorting results by use of selective data weighting, selective reporting, missing information, distorted graphics, Say What? and So What? statistics, and much more!]]></description>
			<content:encoded><![CDATA[<p title="Webinar recording">I held a Webinar for Pyzdek Institute students entitled Statistical Surprises and Absurdities. Topics discussed included sampling bias, misused and misleading averages, distorting results by use of selective data weighting, selective reporting, missing information, distorted graphics, Say What? and So What? statistics, and much more! Here&#8217;s the recording</p>
<div id="media">
            <object id="csSWF" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="640" height="378" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=9,0,115,0"><param name="src" value="http://www.sixsigmatraining.org/videos/statistical-surprises-and-absurdities/statistical-surprises-and-absurdities_controller.swf"/><param name="bgcolor" value="#1a1a1a"/><param name="quality" value="best"/><param name="allowScriptAccess" value="always"/><param name="allowFullScreen" value="true"/><param name="scale" value="showall"/><param name="flashVars" value="autostart=false#&#038;thumb=FirstFrame.png&#038;thumbscale=45&#038;color=0x1A1A1A,0x1A1A1A"/><embed name="csSWF" src="http://www.sixsigmatraining.org/videos/statistical-surprises-and-absurdities/statistical-surprises-and-absurdities_controller.swf" width="640" height="378" bgcolor="#1a1a1a" quality="best" allowScriptAccess="always" allowFullScreen="true" scale="showall" flashVars="autostart=false&#038;thumb=FirstFrame.png&#038;thumbscale=45&#038;color=0x1A1A1A,0x1A1A1A" pluginspage="http://www.macromedia.com/shockwave/download/index.cgi?P1_Prod_Version=ShockwaveFlash"></embed></object>
        </div>
<p><a href="http://www.sixsigmatraining.org/downloads/statistical-surprises-and-absurdities.pdf?source=rss"title="Webinar slides"  target="_blank">Here&#8217;s a link</a> to the slides presented in the webinar.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/statistical-surprises-and-absurdities.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Pyzdek Institute Offers Free Statistics Course with Belt Training</title>
		<link>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/pyzdek-institute-offers-free-statistics-course-with-belt-training.html?source=rss</link>
		<comments>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/pyzdek-institute-offers-free-statistics-course-with-belt-training.html#comments</comments>
		<pubDate>Mon, 28 Nov 2011 12:05:07 +0000</pubDate>
		<dc:creator>Thomas Pyzdek</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Statistical Tools for Six Sigma]]></category>
		<category><![CDATA[basic algebra]]></category>
		<category><![CDATA[black belt training]]></category>
		<category><![CDATA[college statistics]]></category>
		<category><![CDATA[introductory statistics course]]></category>
		<category><![CDATA[lean-six-sigma]]></category>
		<category><![CDATA[michael starbird]]></category>
		<category><![CDATA[risk assessment]]></category>
		<category><![CDATA[six sigma green belt]]></category>
		<category><![CDATA[six-sigma]]></category>
		<category><![CDATA[statistical reasoning]]></category>
		<category><![CDATA[statistical tools]]></category>
		<category><![CDATA[statistics courses]]></category>

		<guid isPermaLink="false">http://www.sixsigmatraining.org/?p=3512</guid>
		<description><![CDATA[The Pyzdek Institute has announced that it is giving away a complete Statistics course with registration for any of its Six Sigma or Lean Six Sigma Green Belt or Black Belt training courses. Who says there's no such thing as a free lunch!?!]]></description>
			<content:encoded><![CDATA[<p>The Pyzdek Institute has announced that it is giving away a complete Statistics course with registration for any of its Six Sigma or Lean Six Sigma Green Belt or Black Belt training courses. The statistics course, which includes 4 DVDs and two follow-along printed guides, consists of 24 lectures of 30 minutes each. Part 1 (12 lectures) covers all of the subjects commonly included with college introductory statistics course. Part 2 (12 lectures) explores a wide variety of applications of statistical methods.These challenging yet accessible lectures assume no background in mathematics beyond basic algebra. While most introductory college statistics courses stress technical problem solving and plugging data into formulae, this course focuses on the logical foundations and underlying strategies of statistical reasoning, illustrated with plenty of examples. Professor Michael Starbird walks you through the most important equations, but his emphasis is on the role of statistics in daily life, giving you a broad overview of how statistical tools are employed in risk assessment, college admissions, drug testing, fraud investigation, and a host of other applications.</p>
<p>This offer is good only while supplies last. <a href="http://www.sixsigmatraining.org/store?source=rss"title="Online training" >Click here</a> to register or to get additional details.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/pyzdek-institute-offers-free-statistics-course-with-belt-training.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Free Webinar about a New Control Chart</title>
		<link>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/free-webinar-about-a-new-control-chart.html?source=rss</link>
		<comments>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/free-webinar-about-a-new-control-chart.html#comments</comments>
		<pubDate>Thu, 22 Sep 2011 17:00:06 +0000</pubDate>
		<dc:creator>Thomas Pyzdek</dc:creator>
				<category><![CDATA[Six Sigma Tools]]></category>
		<category><![CDATA[Statistical Tools for Six Sigma]]></category>
		<category><![CDATA[attributes]]></category>
		<category><![CDATA[binomial]]></category>
		<category><![CDATA[control charts]]></category>
		<category><![CDATA[free webinar]]></category>
		<category><![CDATA[minitab]]></category>
		<category><![CDATA[sigmaxl]]></category>
		<category><![CDATA[six-sigma]]></category>
		<category><![CDATA[statistical applications]]></category>
		<category><![CDATA[statistical methodology]]></category>
		<category><![CDATA[statistical process control]]></category>
		<category><![CDATA[tqm]]></category>

		<guid isPermaLink="false">http://www.sixsigmatraining.org/?p=3317</guid>
		<description><![CDATA[David’s p-prime chart is an innovation that is being used in a wide variety of real-world applications. It is now included in many statistical applications, such as Minitab and SigmaXL.]]></description>
			<content:encoded><![CDATA[<p>The Laney p’ Control Chart is an exciting innovation in statistical process control (SPC). The classic control charts for attributes data (p-charts, u-charts, etc.) are based on assumptions about the underlying distribution of their data (binomial or Poisson). Inherent in those assumptions is the further assumption that the “parameter” (mean) of the distribution is constant over time. In real applications, this is not always true (some days it rains and some days it does not). This is especially noticeable when the subgroup sizes are very large. Until now, the solution has been to treat the observations as variables in an individual’s chart. Unfortunately, this produces flat control limits even if the subgroup sizes vary. David B. Laney developed an innovative approach to this situation which has come to be known as the Laney p’ chart (p-prime chart.) It is a universal technique that is applicable whether the parameter is stable or not.</p>
<h4>About Your Presenter, David B. Laney</h4>
<div class="wp-caption alignleft" style="width: 76px"><img style="margin: 5px; border: 0pt none;" title="David B. Laney" src="http://6-sigma-training.com/file.php/1/david_laney.jpg" alt="David B. Laney" width="66" height="100" align="left" border="0" hspace="5" vspace="5" /><p class="wp-caption-text">David B. Laney</p></div>
<p>David B. Laney worked for 33 years at BellSouth as Directory of Statistical Methodology. He is a pioneer at BellSouth in TQM, DOE, and Six Sigma. David’s p-prime chart is an innovation that is being used in a wide variety of areas. It is now included in many statistical applications, such as Minitab and SigmaXL. David is enjoying retirement with his family in the Birmingham, Alabama area.</p>
<h4>Date: Wednesday, September 28, 2011</h4>
<p>Session #1, 1:00 PM Eastern Time. <a href="https://www1.gotomeeting.com/register/640334577"title="Register for first session"  target="_blank">Click here</a> to register.<br />
Session #2, 7:00 PM Eastern Time. <a href="https://www1.gotomeeting.com/register/790382009"title="Register for second session"  target="_blank">Click here</a> to register.</p>
<p>&nbsp;</p>
<h3>Update</h3>
<p><a href="http://www.sixsigmatraining.org/videos/2011-09-28-laney-p-prime-chart-webinar.wmv?source=rss"title="Link to recording of webinar"  target="_blank">Click here</a> to view a video recording of David&#8217;s webinar.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/free-webinar-about-a-new-control-chart.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
<enclosure url="http://www.sixsigmatraining.org/videos/2011-09-28-laney-p-prime-chart-webinar.wmv" length="300725542" type="video/asf" />
		</item>
		<item>
		<title>Free E-handbook of Statistical Methods</title>
		<link>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/free-e-handbook-of-statistical-methods.html?source=rss</link>
		<comments>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/free-e-handbook-of-statistical-methods.html#comments</comments>
		<pubDate>Thu, 08 Sep 2011 22:26:30 +0000</pubDate>
		<dc:creator>Thomas Pyzdek</dc:creator>
				<category><![CDATA[Resource Providers]]></category>
		<category><![CDATA[Six Sigma Tools]]></category>
		<category><![CDATA[Statistical Tools for Six Sigma]]></category>
		<category><![CDATA[case studies]]></category>
		<category><![CDATA[lean-six-sigma]]></category>
		<category><![CDATA[nist]]></category>
		<category><![CDATA[six-sigma]]></category>
		<category><![CDATA[statistical methods]]></category>
		<category><![CDATA[statistical tool]]></category>
		<category><![CDATA[us department of commerce]]></category>

		<guid isPermaLink="false">http://www.sixsigmatraining.org/?p=3295</guid>
		<description><![CDATA[Access the NIST/SEMATECH e-Handbook of Statistical Methods. NIST is an agency of the US Department of commerce, so this work was undertaken at public expense. It covers literally every statistical tool used in Lean Six Sigma, and many, many more. It includes hundreds of case studies and examples. Best of all, it's free! Enjoy! ]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.itl.nist.gov/div898/handbook/"title="E-handbook of statistical methods"  target="_blank">Click here</a> to access the <em>NIST/SEMATECH e-Handbook of Statistical Methods.</em> NIST is an agency of the US Department of commerce, so this work was undertaken at public expense. It covers literally every statistical tool used in Lean Six Sigma, and many, many more. It includes hundreds of case studies and examples. Best of all, it&#8217;s free! Enjoy!</p>
<div id="attachment_3298" class="wp-caption alignleft" style="width: 588px"><a href="http://www.sixsigmatraining.org/2011/09/nist-ehandbook-example.png?source=rss"title="Skewness and Kurtosis" rel="http://www.itl.nist.gov/div898/handbook/eda/section3/eda35b.htm"  target="_blank"><img class="size-full wp-image-3298 " title="nist-ehandbook-example" src="http://www.sixsigmatraining.org/2011/09/nist-ehandbook-example.png" alt="nist-ehandbook-example" width="578" height="1006" /></a><p class="wp-caption-text">Skewness from NIST E-handbook</p></div>
]]></content:encoded>
			<wfw:commentRss>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/free-e-handbook-of-statistical-methods.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Using QI Macros to Test Normality</title>
		<link>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/using-qi-macros-to-test-normality.html?source=rss</link>
		<comments>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/using-qi-macros-to-test-normality.html#comments</comments>
		<pubDate>Tue, 19 Apr 2011 12:00:22 +0000</pubDate>
		<dc:creator>Thomas Pyzdek</dc:creator>
				<category><![CDATA[Six Sigma Tools]]></category>
		<category><![CDATA[Six Sigma Videos]]></category>
		<category><![CDATA[Statistical Tools for Six Sigma]]></category>

		<guid isPermaLink="false">http://www.sixsigmatraining.org/?p=3111</guid>
		<description><![CDATA[This video illustrates the use of QI macros software to test the assumption that a set of data follow the normal distribution.]]></description>
			<content:encoded><![CDATA[<div id="media">
            <object id="csSWF" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="640" height="378" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=9,0,115,0"><param name="src" value="http://www.pyzdek.com/videos/qimacros-normality-test-blog/qimacros-normality-test-blog_controller.swf"/><param name="bgcolor" value="#1a1a1a"/><param name="quality" value="best"/><param name="allowScriptAccess" value="always"/><param name="allowFullScreen" value="true"/><param name="scale" value="showall"/><param name="flashVars" value="autostart=false#&#038;thumb=FirstFrame.png&#038;thumbscale=45&#038;color=0x1A1A1A,0x1A1A1A"/><embed name="csSWF" src="http://www.pyzdek.com/videos/qimacros-normality-test-blog/qimacros-normality-test-blog_controller.swf" width="640" height="378" bgcolor="#1a1a1a" quality="best" allowScriptAccess="always" allowFullScreen="true" scale="showall" flashVars="autostart=false&#038;thumb=FirstFrame.png&#038;thumbscale=45&#038;color=0x1A1A1A,0x1A1A1A" pluginspage="http://www.macromedia.com/shockwave/download/index.cgi?P1_Prod_Version=ShockwaveFlash"></embed></object>
        </div>
]]></content:encoded>
			<wfw:commentRss>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/using-qi-macros-to-test-normality.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Statistical Engineering</title>
		<link>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/statistical-engineering.html?source=rss</link>
		<comments>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/statistical-engineering.html#comments</comments>
		<pubDate>Mon, 11 Apr 2011 12:00:05 +0000</pubDate>
		<dc:creator>Thomas Pyzdek</dc:creator>
				<category><![CDATA[Six Sigma Tools]]></category>
		<category><![CDATA[Statistical Tools for Six Sigma]]></category>
		<category><![CDATA[aristotle]]></category>
		<category><![CDATA[corporate data warehouses]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[hal varian]]></category>
		<category><![CDATA[lean-six-sigma]]></category>
		<category><![CDATA[roger w hoerl]]></category>
		<category><![CDATA[statistical engineering]]></category>
		<category><![CDATA[statistical methods]]></category>
		<category><![CDATA[statistical software]]></category>
		<category><![CDATA[statisticians]]></category>

		<guid isPermaLink="false">http://www.sixsigmatraining.org/?p=3057</guid>
		<description><![CDATA[Statistical Engineering implies the application of statistics in a systematic framework that utilizes technology to create or improve products, processes and services that improve the lives of people.]]></description>
			<content:encoded><![CDATA[<p>In the movie &#8220;The Graduate,&#8221; the new graduate is told by a would-be mentor to remember only one word as he heads out into the world: Plastics. Times have changed. Hal Varian, the chief economist at Google says, ‘‘I keep saying that the sexy job in the next 10 years will be statisticians. And I’m not kidding.’’ Statistical methods are being used by a larger cross-section of people in a wider variety of industries than ever before. There are numerous reasons for this. Nearly everyone has what was once considered to be a supercomputer sitting on their desktop. Powerful statistical software is widely available, including popular packages like <a href="http://www.minitab.com" target="_blank">Minitab</a>, <a href="http://www.jmp.com">JMP</a>, <a href="http://www.sas.com" target="_blank">SAS </a>and <a href="http://www.spss.com" target="_blank">SPSS</a>, and extremely powerful <a href="http://www.r-project.org"title="R-Project"  target="_blank">free software</a>. Oracle&#8217;s <a href="http://www.oracle.com/us/products/applications/crystalball/crystalball-066563.html"title="Crystal Ball software"  target="_blank">Crystal Ball software</a> makes it possible to create a statistical distribution for any cell in a spreadsheet, making statistical simulation a snap. While becoming more sophisticated, the software is also becoming easier to use. Output is increasingly graphical and easier to explain to laypersons. The number of people trained in Lean Six Sigma methods is growing rapidly. There is an enormous amount of data saved in public and corporate data warehouses. The list goes on and on.<br />

<div>But perhaps the most important reason for the ballooning use of statistics is: it works.</div>
<p>
<div>If we take Aristotle&#8217;s logic as the historical starting point for rational analysis, and Galileo&#8217;s experimental method as the next major leap, then statistical methods might be viewed as the next step in applied analysis. Many problems don&#8217;t lend themselves to solution by pure logic nor by carefully planned and controlled experimentation. Most organizations, especially in the commercial sector, must deal with so many problems and such a dynamic external environment that they are forced to make quick decisions despite large uncertainty, then move on to the next problem. Statistical methods help these decision makers evaluate the evidence and make better decisions quickly. The tools and technology described in the first paragraph make this easier than ever before.</div>
<p>
<div>This situation is much more akin to engineering than it is to pure science. The approach has been termed &#8220;Statistical Engineering.&#8221; Authors Roger W. Hoerl and Ron Snee describe Statistical Engineering as follows:</div>
<p>
<div>
<div style="padding-left: 30px;"><em>&#8220;The statistical engineering discipline [is] the study of how to utilize the principles and techniques of statistical science for beneﬁt of humankind. From an operational perspective we deﬁne statistical engineering as the study of </em><em>how to best utilize statistical concepts, methods, and tools and integrate them with information technology and other relevant sciences to generate improved results. In other words, engineers—statistical or otherwise—do not focus on advancement of the fundamental laws of science but rather how they might be best utilized for practical beneﬁt.</em></div>
<p>
<div>This definition goes beyond applied statistics. Statistical Engineering implies the application of statistics in a systematic framework that utilizes technology to create or improve products, processes and services that improve the lives of people. Disciplines such as Lean Six Sigma, Quality Engineering, Reliability Engineering, and others can be said to do this to some degree, but there are other ways to use Statistical Engineering, some quite unexpected. Billy Beane, general manager of MLB&#8217;s Oakland A&#8217;s and protagonist of Michael Lewis&#8217;s book <em>Moneyball</em>, had a problem: how to win in the Major Leagues with a budget that&#8217;s smaller than that of nearly every other team. Conventional wisdom long held that big name, highly athletic hitters and young pitchers with rocket arms were the ticket to success. But Beane and his staff, buoyed by massive amounts of carefully interpreted statistical data, believed that wins could be had by more affordable methods such as hitters with high on-base percentage and pitchers who get lots of ground outs. Given this information and a tight budget, Beane defied tradition and his own scouting department to build winning teams of young affordable players and inexpensive castoff veterans. Author Michael Lewis examines how in 2002 the Oakland Athletics achieved a spectacular winning record while having the smallest player payroll of any major league baseball team. Given the heavily publicized salaries of players for teams like the Boston Red Sox or New York Yankees, baseball insiders and fans assume that the biggest talents deserve and get the biggest salaries. However, argues author Michael Lewis, little-known numbers and statistics matter more.</div>
<p>
<div>Statistical Engineering is not limited to applied statistics, theoretical statistics have a place too. In a paper published in the April-June 2011 issue of the journal <em>Quality Engineering</em> author Philip R. Scinto offers this list of Statistical Engineering attributes:</div>
<p>
<div>
<ul>
<li>Meets high-level needs of an organization</li>
<li>Work/study for the greater good</li>
<li>Use of statistical concepts and tools</li>
<li>Collaborative effort with other sciences</li>
<li>Integrated with other sciences</li>
<li>Documented protocol</li>
<li>Activity continuous with sustainable life</li>
<li>Improved results</li>
</ul>
<p>It isn&#8217;t necessary that all items on the list be checked off, but the list is useful in evaluating whether an activity qualifies as Statistical Engineering or if it&#8217;s merely another clever use of statistics. The important thing isn&#8217;t the label we apply, but the improvement that can be achieved by properly using statistical methods along with science and technology to achieve a challenging goal.</p>
</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/statistical-engineering.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Problem with Swiss Army Knife Control Charting</title>
		<link>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/the-problem-with-swiss-army-knife-control-charting.html?source=rss</link>
		<comments>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/the-problem-with-swiss-army-knife-control-charting.html#comments</comments>
		<pubDate>Thu, 31 Mar 2011 20:12:01 +0000</pubDate>
		<dc:creator>Thomas Pyzdek</dc:creator>
				<category><![CDATA[Six Sigma Tools]]></category>
		<category><![CDATA[Statistical Tools for Six Sigma]]></category>
		<category><![CDATA[3 standard deviations]]></category>
		<category><![CDATA[control charts]]></category>
		<category><![CDATA[lean-six-sigma]]></category>
		<category><![CDATA[real world]]></category>
		<category><![CDATA[six sigma green belt]]></category>
		<category><![CDATA[statistical process control]]></category>
		<category><![CDATA[statistical process control spc]]></category>

		<guid isPermaLink="false">http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/the-problem-with-swiss-army-knife-control-charting.html</guid>
		<description><![CDATA[The skewed distribution problem is exacerbated by using I-charts. ]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m an advocate of using the I-chart as the default control chart. If I am teaching statistical process control (SPC) and can only teach one chart, the I-chart is always the one that I teach. It&#8217;s the only control chart I cover in my Lean Six Sigma Green Belt training. It&#8217;s the only chart that I teach in Process Excellence Leadership training. It&#8217;s the only chart I use if the data I&#8217;m looking at are reasonably close to symmetric (note that I didn&#8217;t say &#8220;normal&#8221;,) unless I have some compelling need for greater sensitivity. I teach that the I-chart is the &#8220;Swiss army knife&#8221; of control charts.</p>
<p>But I still sometimes use other control charts.</p>
<h4>The Problem</h4>
<p>Organizations don&#8217;t do SPC for the fun of it. They do it because it helps them achieve their goals. Organizations exist to produce things of value for the benefit of customers, investors, and employees. They do this by transforming inputs into outputs of higher value via processes. They can do this better if they minimize variability of outcomes, which can best be accomplished by controlling the sources of variation in the inputs and processes. This is where SPC comes in. SPC is a methodology that uses statistical guidelines to help separate &#8220;special cause&#8221; and &#8220;common cause&#8221; variation. If a special cause of variation exists, it signals the need to act. Special cause variation is defined as a change of such a large magnitude that its cause can probably be identified if looked for at once. SPC operationally defines such a change as a measurement result more than 3 standard deviations from the process mean for whatever process metric is being monitored.</p>
<p>A problem might exist if the process generates measurements that are highly skewed,  even when it is <em>not</em> being influenced by special causes of variations. Such processes are quite common in the real world. For example, nearly all measurements produced by geometric dimensioning and tolerancing are skewed, as are measurements of time-based phenomena such as those encountered in services industries including the healthcare and hospitality industries.  Highly skewed distributions produce a relatively high percentage of results more than 3 standard deviations from the mean even if no special causes exist. In other words, they produce many &#8220;false alarms&#8221; that will trigger a search for a problem when there is no problem. The false alarms may even lead to tampering, thereby causing a stable process to become unstable.</p>
<h4>I-Charts Don&#8217;t Solve the Problem</h4>
<p>The skewed distribution problem is exacerbated by using I-charts. I-charts are relatively insensitive to moderate departures from normality, and  very insensitive if the non-normality still produces a symmetric distribution. But for the data described above, this is not the case. If you use the I-chart for these data you will experience many false alarms. It&#8217;s just that simple.</p>
<p>The problem is to determine if a process is or is not being influenced by special causes of variation. A process distribution might appear as skewed because of special cause outliers, or because it naturally produces skewed data. The I-chart treats all data beyond 3 sigma as outliers; it doesn&#8217;t help you separate the natural, common cause process outcomes from special cause outcomes. Is the point beyond 3 sigma an outlying chicken, or a common cause egg? I.e., is the process being influenced by special causes, or only common causes? If the process data are naturally skewed you can&#8217;t answer this question using an I-chart.</p>
<h4>A Simple Solution</h4>
<p>The solution that I recommend is to begin your investigation with averages charts, also known as x-bar charts. Averages tend to have distributions that are approximately normal, even if the individual values are skewed. This means that, for a process with a skewed distribution that is not influenced by special causes, averages are much more likely to produce results that stay within 3 standard deviations of the mean than I-charts. It&#8217;s the best of both worlds: few false alarms, but still sensitive to special causes. If you have a nice run of subgroup averages without a special cause, plot a histogram of the data and see if the distribution looks skewed or symmetric. If the latter, you can use I-charts with confidence. If the former, stick with averages charts, or find a statistician or Master Black Belt to help you find a more advanced solution.</p>
<h4>Stable Does Not Mean Normal</h4>
<p>Before ending this article, I&#8217;d like to address another pet peeve of mine. I believe that too many teachers of SPC obsess on the need for normality. They confuse normality with the absence of special causes, also known as statistical control. I usually attribute this misunderstanding to a lack of experience with the real world, where normal distributions are so rare as to be virtually non-existent. By insisting on normality we encourage tampering and all of the problems associated with this approach to &#8220;process management.&#8221;</p>
<p>On the other hand, I am also impatient with people who insist that all non-normality be ignored. These individuals advocate using I-charts in all situations, regardless of the risk of false alarms. This attitude may also be due to a lack of experience. However, I&#8217;ve seen SPC lose its credibility when concerned process owners look for special causes over-and-over again without finding them. Like the boy who cried &#8220;Wolf!&#8221;, out-of-control signals become something to ignore. Eventually so does SPC.</p>
<p>My approach, which favors the I-chart but doesn&#8217;t make its use dogma, provides a rational middle ground.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/the-problem-with-swiss-army-knife-control-charting.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>When to Use Your Eyeballs, and When Not To</title>
		<link>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/when-to-use-your-eyeballs-and-when-not-to.html?source=rss</link>
		<comments>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/when-to-use-your-eyeballs-and-when-not-to.html#comments</comments>
		<pubDate>Fri, 25 Mar 2011 00:58:12 +0000</pubDate>
		<dc:creator>Thomas Pyzdek</dc:creator>
				<category><![CDATA[Six Sigma Tools]]></category>
		<category><![CDATA[Statistical Tools for Six Sigma]]></category>
		<category><![CDATA[conclusion]]></category>
		<category><![CDATA[eyeball]]></category>
		<category><![CDATA[histogram]]></category>
		<category><![CDATA[histograms]]></category>
		<category><![CDATA[normal curve]]></category>
		<category><![CDATA[normality test]]></category>
		<category><![CDATA[random numbers]]></category>
		<category><![CDATA[six-sigma]]></category>
		<category><![CDATA[statistical properties]]></category>
		<category><![CDATA[statistical software]]></category>
		<category><![CDATA[variation]]></category>

		<guid isPermaLink="false">http://www.sixsigmatraining.org/?p=3008</guid>
		<description><![CDATA[The point is that in the business world we often need to make decisions and then get on to other, more urgent matters. The normal distribution is a handy device for getting quick estimates that are useful for such decisions. ]]></description>
			<content:encoded><![CDATA[<div id="attachment_3007" class="wp-caption alignleft" style="width: 211px"><a href="http://www.sixsigmatraining.org/2011/03/two-normal-curves.png?source=rss"><img class="size-medium wp-image-3007 " title="two-normal-curves" src="http://www.sixsigmatraining.org/2011/03/two-normal-curves-201x300.png" alt="" width="201" height="300" /></a><p class="wp-caption-text">Fig. 1-Large and Small Samples of Normally Distributed Data</p></div>
<p>One of the exercises I assign to students in my training involves creating two histograms from normally distributed random numbers. The results often look similar to those shown in figure 1. When I ask students to comment on their histograms I usually get comments about the averages, spread, and other statistical properties. However, that misses the point I’m trying to teach.</p>
<p>When we do Six Sigma we usually spend a lot of time mining historical data from databases. Sometimes the sample sizes are large, and sometimes they can be quite small. In fact, even large sample sizes can become small when we slice-and-dice them drilling down with various categories and sub-categories in search of CTQs. Statistical software will often automatically fit a normal curve to histograms created from these data. It’s often tempting to use the fitted curves to make an eyeball judgment about the normality of the data. Sometimes this is a good idea, and sometimes it isn’t. If the sample sizes are small, then the curve may not appear to fit the data very well simply because of small sample variation. Witness the top histogram in figure 1 for an example of a curve fitted to a histogram from a sample size of <em>n</em>=20. The histogram looks like a poor fit, but the p-value of a normality test tells us that the fit is pretty good anyway. So we’re probably safe assuming normality and acting accordingly.</p>
<p>The lower curve is fitted to a sample of <em>n</em>=500 data values. It appears to be a much better fit, and the p-value will back this conclusion. But what if the eyeballed curve fit and the p-value disagree?</p>
<div id="attachment_3006" class="wp-caption alignright" style="width: 358px"><a href="http://www.sixsigmatraining.org/2011/03/histogram-and-probability-plot.jpg?source=rss"><img class="size-medium wp-image-3006 " title="histogram-and-probability-plot" src="http://www.sixsigmatraining.org/2011/03/histogram-and-probability-plot-300x200.jpg" alt="histogram-and-probability-plot" width="348" height="232" /></a><p class="wp-caption-text">Fig. 2-Decent Fit but Lousy P-value</p></div>
<p>Sometimes the fit of the curve is “close enough,” but the p-value will tell you that the fit is awful. Take a look at figure 2. The histogram suggests that the normal curve fits the data pretty well. There are many practical situations where you could use the normal distribution to make estimates and your estimates would be just fine. These are data on the time it takes to complete technical support calls. If you assume normality and you estimate costs or make a decision about process acceptability, your decisions will be essentially correct. However, the probability plot and AD goodness-of-fit statistic clearly show that the data are <em>not</em> normal and that the lack of fit is particularly poor in the tails (p &lt; 0.005.) A closer examination shows that even in the tail areas the discrepancies are fractions of a percent. For example, the normal distribution estimates that 99.9% of all calls will take less than 35 minutes to complete, while the data show about 99.5%. Chances are these differences are of little or no <em>practical</em> importance.</p>
<p>The point is that in the business world we often need to make decisions and then get on to other, more urgent matters. The normal distribution is a handy device for getting quick estimates that are useful for such decisions. If your sample size is relatively large (say 200 or more) then you can go with the normality assumption if the fitted curve looks reasonably good. On the other hand, if you only have a small amount of data, you can still use the normality assumption if the histogram fit looks lousy, providing the p-value of the goodness-of-fit statistic says the normal curve is okay, i.e., if p &gt; 0.05. The normality assumption is so useful that it&#8217;s worth using as a default, even if you bend the rules a bit.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/when-to-use-your-eyeballs-and-when-not-to.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A Sampling Question</title>
		<link>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/a-sampling-question.html?source=rss</link>
		<comments>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/a-sampling-question.html#comments</comments>
		<pubDate>Mon, 14 Mar 2011 14:00:23 +0000</pubDate>
		<dc:creator>Thomas Pyzdek</dc:creator>
				<category><![CDATA[Inspection]]></category>
		<category><![CDATA[Statistical Tools for Six Sigma]]></category>
		<category><![CDATA[automation industry]]></category>
		<category><![CDATA[lean-six-sigma]]></category>
		<category><![CDATA[process variability]]></category>
		<category><![CDATA[sampling frequency]]></category>
		<category><![CDATA[six sigma green belt]]></category>
		<category><![CDATA[six-sigma]]></category>
		<category><![CDATA[spc control]]></category>

		<guid isPermaLink="false">http://www.sixsigmatraining.org/?p=2973</guid>
		<description><![CDATA[There is no rule that you need to sample n consecutive samples every x number of cycles. You are probably thinking of a technique known as PRE-Control, which is different than SPC.]]></description>
			<content:encoded><![CDATA[<p>A Six Sigma Green Belt student asked an interesting question about sampling. Here&#8217;s the question and my response.</p>
<p>QUESTION:<br />
======================<br />
Just a question that I thought I would run by you&#8230;<br />
I work in the Automation industry, and am currently working on two production lines, and logging data for the parts being produced. One line is producing 60 parts per minute and I can thus log the data for every part. The other line is producing 240 parts per minute, and it is not possible to log the data for every part. I remember reading somewhere that in order to perform SPC you must take n consecutive samples (I think n was 5) every x number of cycles. What I need is definitive guidance on how to calculate n and x. I also need to know the statistical reason that n and x are used in order to explain this to the customer. Any feedback you can give me in relation to this would be greatly appreciated.</p>
<p>Regards,</p>
<p>AT in Irelend</p>
<p>RESPONSE:<br />
======================<br />
There is no rule that you need to sample n consecutive samples every x number of cycles. You are probably thinking of a technique known as PRE-Control, which is different than SPC. PRE-control also incorporates rules for deciding when to increase or decrease sampling frequency, stopping rules for processes, etc.. Personally, I don&#8217;t like PRE-Control for a variety of reasons, but if you have <em>The Six Sigma Handbook</em>, 2nd edition I discuss in starting on p. 661 or the 3rd edition starting on p. 465. My primary reason for disliking PRE-control is that it is a specification-based scheme (which I dislike in principle) and it will allow process variation to increase until it is as wide as the specs allow. SPC is all about reducing process variability to a minimum by identifying special causes of variation. When used in conjunction with Lean Six Sigma, SPC will also address common cause variation.</p>
<p>Instead of PRE-control I suggest that you consider using standard SPC control charts. I don&#8217;t know anything about your process so I can only offer general advice. If you&#8217;re already logging in metrics for 60 parts-per-minute I would be surprised if you&#8217;re not encountering problems like autocorrelation, which requires an adjustment to standard SPC such as using EWMA charts instead of classical control  charts. If you have autocorrelation and are not using the proper chart, then you will be experiencing a lot of &#8220;false alarms.&#8221; Processes seldom change by any meaningful amount in a matter of seconds, so you can probably extend the sampling interval. If you feel that you can economically sample 60 per minute, and that it is wise to do so, then you could sample this number of parts from the process running 240 parts per minute rather than checking every part. It would be best to choose the sample at random, rather than sampling every 4th part. Samples chosen using a fixed pattern are susceptible to problems if the process exhibits similar patterns. For example, if the process had 4 positions on a workstation then your 1-in-every-4 sample would always be sampling from the same workstation. Sometimes the patterns in the process are quite difficult to spot, and &#8220;Murphy&#8217;s Law&#8221; can strike at any time. Murphy&#8217;s Law states that anything that can go wrong, will go wrong.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.sixsigmatraining.org/statistical-tools-for-six-sigma/a-sampling-question.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

