<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>BioStatMatt</title>
	<atom:link href="http://biostatmatt.com/feed" rel="self" type="application/rss+xml" />
	<link>http://biostatmatt.com</link>
	<description></description>
	<lastBuildDate>Wed, 16 May 2012 19:14:44 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Update: Parameters as Population Quantities</title>
		<link>http://biostatmatt.com/archives/2166</link>
		<comments>http://biostatmatt.com/archives/2166#comments</comments>
		<pubDate>Wed, 16 May 2012 19:14:44 +0000</pubDate>
		<dc:creator>BioStatMatt</dc:creator>
				<category><![CDATA[Technical]]></category>
		<category><![CDATA[Bayesian]]></category>
		<category><![CDATA[bootstrap]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://biostatmatt.com/?p=2166</guid>
		<description><![CDATA[Some time ago, I had an ineloquent and less-than-cordial online discussion with a commenter on this site, partially about how statisticians define the term "parameter". This post is just to quote a relevant passage from "Bootstrap Methods and Their Application", by Davison and Hinkley (1997), that better articulates a point I had made earlier. 2.1.1 [...]]]></description>
			<content:encoded><![CDATA[<p>Some time ago, I had an ineloquent and less-than-cordial <a href="http://biostatmatt.com/archives/1773">online discussion</a> with a commenter on this site, partially about how statisticians define the term "parameter". This post is just to quote a relevant passage from "Bootstrap Methods and Their Application", by Davison and Hinkley (1997), that better articulates a point I had made earlier.</p>
<blockquote><p>
2.1.1 Statistical Functions<br />
Many simple statistics can be thought of in terms of properties of the EDF [empirical distribution function]. For example the sample average $\bar{y} = n^{-1} \sigma y_j$ is the mean of the EDF. More generally, the statistic of interest $t$ will be a symmetric function of $y_1,\ldots,y_n$, meaning that $t$ is unaffected by reordering the data. This implies that $t$ depends on the ordered values $y_{(1)} \leq \cdots \leq y_{(n)}$, or equivalently on the EDF $\hat{F}$. Often this can be expressed simply as $t = t(\hat{F})$, where $t(\cdot)$ is a <em>statistical function</em> - essentially just a mathematical expression of the algorithm for computing $t$ from $\hat{F}$. Such a statistical function is of central importance in the nonparametric case because it also defines the parameter of interest $\theta$ through the "algorithm" $\theta = t(F)$. This corresponds to the qualitative idea that $\theta$ is a characteristic of the population described by $F$...
</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://biostatmatt.com/archives/2166/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>useR! 2012 Deadlines Approaching: Registration, Hotels,  Student Scholarships</title>
		<link>http://biostatmatt.com/archives/2150</link>
		<comments>http://biostatmatt.com/archives/2150#comments</comments>
		<pubDate>Thu, 05 Apr 2012 17:28:24 +0000</pubDate>
		<dc:creator>BioStatMatt</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[useR]]></category>

		<guid isPermaLink="false">http://biostatmatt.com/?p=2150</guid>
		<description><![CDATA[Forwarded from Frank Harrell: DEADLINES FAST APPROACHING – 8th Annual International R User Conference useR! 2012, Nashville, Tennessee USA Registration Deadlines: Early Registration: Passed Regular Registration: Mar 1- May 12 Late Registration: May 13 – June 4 On-Site Registration: June 12 – June 15 Please note: Nashville is offering several large entertainment events the month [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://biostat.mc.vanderbilt.edu/wiki/Main/useR-2012"><img src="http://biostatmatt.com/uploads/useRLogoMedium1.png" alt="" title="useRLogoMedium" width="525" height="239" class="aligncenter size-full wp-image-2110" /></a></p>
<p>Forwarded from Frank Harrell:</p>
<p><b>DEADLINES FAST APPROACHING</b> – 8th Annual International R User Conference useR! 2012, Nashville, Tennessee USA</p>
<p>Registration Deadlines:<br />
Early Registration: Passed<br />
Regular Registration: Mar 1- May 12<br />
Late Registration: May 13 – June 4<br />
On-Site Registration: June 12 – June 15</p>
<p>Please note: Nashville is offering several large entertainment events the month of June, and hotels are quickly selling out.  It's imperative that you make your hotel accommodations for the conference as soon as possible.  For those of you who have submitted abstracts we will be<br />
notifying you this week regarding acceptance as an oral presentation or as a poster.</p>
<p>Students: A limited number of $500 reimbursements for registration and travel expenses are available, based on merit and need.  Please apply by sending an application to Tatsuki Koyama at tatsuki.koyama@Vanderbilt.Edu by April 15.  Include a brief CV, a copy of your abstract if one was submitted, a statement that demonstrates your need for financial assistance, and a letter of support from your supervisor.</p>
<p>Please join us at the 8th Annual International R User Conference useR! 2012 in Nashville, Tennessee.  For more conference details, please visit <a href="http://biostat.mc.vanderbilt.edu/wiki/Main/useR-2012"><tt>http://biostat.mc.vanderbilt.edu/wiki/Main/useR-2012</tt></a></p>
]]></content:encoded>
			<wfw:commentRss>http://biostatmatt.com/archives/2150/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Resampling Hierarchically Structured Data Recursively</title>
		<link>http://biostatmatt.com/archives/2125</link>
		<comments>http://biostatmatt.com/archives/2125#comments</comments>
		<pubDate>Wed, 04 Apr 2012 15:47:39 +0000</pubDate>
		<dc:creator>BioStatMatt</dc:creator>
				<category><![CDATA[Technical]]></category>
		<category><![CDATA[bootstrap]]></category>
		<category><![CDATA[graphics]]></category>
		<category><![CDATA[programming]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://biostatmatt.com/?p=2125</guid>
		<description><![CDATA[That's a mouthful! I presented this topic to a group of Vandy statisticians a few days ago. My notes (essentially reproduced in this post) are recorded at the Dept. of Biostatistics wiki: HowToBootstrapCorrelatedData. The presentation covers some bootstrap strategies for hierarchically structured (correlated) data, but focuses on the multi-stage bootstrap; an extension of that described [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://biostatmatt.com/uploads/graph34679d7251d157935c4523e3c90235841.png"><img src="http://biostatmatt.com/uploads/graph34679d7251d157935c4523e3c90235841.png" alt="" title="graph34679d7251d157935c4523e3c9023584" width="695" height="251" class="aligncenter size-full wp-image-2146" /></a></p>
<p>That's a mouthful! I presented this topic to a group of Vandy statisticians a few days ago. My notes (essentially reproduced in this post) are recorded at the Dept. of Biostatistics wiki: <a href="http://biostat.mc.vanderbilt.edu/wiki/Main/HowToBootstrapCorrelatedData">HowToBootstrapCorrelatedData</a>. The presentation covers some bootstrap strategies for hierarchically structured (correlated) data, but focuses on the <em>multi-stage</em> bootstrap; an extension of that described by Davison and Hinkley (ISBN 978-0-521-57471-6).</p>
<p>The multi-stage bootstrap mimics the data generating mechanism by resampling in a nested fashion. For example, resample first among factors at the highest level of hierarchy. Then, for each resampled factor, further resample among factors at the next lower level, and so forth. Each level may be resampled with or without replacement. Furthermore, some levels of hierarchy may be ignored completely, if considered to have little or no effect on the data correlation structure. Whether to ignore a level of hierarchy, or to sample with replacement are important bootstrap design considerations.</p>
<p>The <tt>resample</tt> function below implements a multi-stage bootstrap recursively. That is, levels of hierarchy are traversed by nested calls to <tt>resample</tt>. The <tt>dat</tt> argument is a dataframe with factor fields for each level of hierarchy (<em>e.g.</em>, hospital, patient, measurement), and a numeric field of measured values. The <tt>cluster</tt> argument is a character vector that identifies the hierarchy in order from top to bottom (<em>e.g.</em>, <tt>c('hospital','patient','measurement')</tt>). The <tt>replace</tt> argument is a logical vector that indicates whether sampling should be with replacement at the corresponding level of hierarchy (<em>e.g.</em>, <tt>c(TRUE,FALSE,FALSE)</tt>).</p>
<pre>
resample &lt;- function(dat, cluster, replace) {

  # exit early for trivial data
  if(nrow(dat) == 1 || all(replace==FALSE))
      return(dat)

  # sample the clustering factor
  cls &lt;- sample(unique(dat[[cluster[1]]]), replace=replace[1])

  # subset on the sampled clustering factors
  sub &lt;- lapply(cls, function(b) subset(dat, dat[[cluster[1]]]==b))

  # sample lower levels of hierarchy (if any)
  if(length(cluster) > 1)
    sub &lt;- lapply(sub, resample, cluster=cluster[-1], replace=replace[-1])

  # join and return samples
  do.call(rbind, sub)

}
</pre>
<p>The following block of <tt>R</tt> code simulates a dataset with 5 correlated (rho = 0.4) repeat measurements on each of 10 patients, from each of 5 hospitals. Hence, there are 250 simulated measurements and 50 patients in total. Patients are simulated independently (<em>i.e.</em>, the hospital level of hierarchy has no affect on the correlation structure). The functions <tt>covimage</tt> and <tt>datimage</tt> generate a levelplot representations of the covariance and data matrices for the simulated data, respectively.</p>
<pre>
# simulate correlated data
rho &lt;- 0.4
dat &lt;- expand.grid(
  measurement=factor(1:5),
  patient=factor(1:10),
  hospital=factor(1:5))
sig &lt;- rho * tcrossprod(model.matrix(~ 0 + patient:hospital, dat))
diag(sig) &lt;- 1
dat$value &lt;- chol(sig) %*% rnorm(250, 0, 1)

library("lattice")

covimage &lt;- function(x)
   levelplot(as.matrix(x), aspect="fill", scales=list(draw=FALSE),
      xlab="", ylab="", colorkey=FALSE, col.regions=rev(gray.colors(100, end=1.0)),
      par.settings=list(axis.line=list(col=NA,lty=1,lwd=1)))

datimage &lt;- function(x) {
   mat &lt;- as.data.frame(lapply(x, as.numeric))
   levelplot(t(as.matrix(mat)), aspect="fill", scales=list(cex=1.2, y=list(draw=FALSE)),
      ylab="", xlab="", colorkey=FALSE, col.regions=gray.colors(100),
      par.settings=list(axis.line=list(col=NA,lty=1,lwd=1)))
}

datimage(dat)
covimage(sig)
</pre>
<p>The images below result from calls to <tt>datimage(dat)</tt> and <tt>covimage(dat)</tt> respectively.</p>
<p><a href="http://biostatmatt.com/uploads/datimage.png"><img src="http://biostatmatt.com/uploads/datimage.png" alt="" title="datimage" width="600" height="480" class="aligncenter size-full wp-image-2136" /></a></p>
<p><a href="http://biostatmatt.com/uploads/covimage.png"><img src="http://biostatmatt.com/uploads/covimage.png" alt="" title="covimage" width="600" height="480" class="aligncenter size-full wp-image-2135" /></a></p>
<p>The next block of <tt>R</tt> code generates several boostrap distributions for the sample mean, and approximates the 'true' sampling distribution by Monte Carlo. The final series of boxplots (shown below) illustrate that bootstrap design greatly impacts the inferred distribution of the sample mean (and presumably for other sample statistics). Hence, it's important to think carefully about bootstrap design for hierarchically structured data, and ensure that it closely reflects the 'true' data generating mechanism.</p>
<pre>
# bootstrap ignoring hospital and patient levels
cluster &lt;- c("measurement")
system.time(mF &lt;- replicate(200, mean(resample(dat, cluster, c(F))$val)))
system.time(mT &lt;- replicate(200, mean(resample(dat, cluster, c(T))$val)))
#boxplot(list("F" = mF, "T" = mT))

# bootstrap ignoring hospital level
cluster &lt;- c("patient","measurement")
system.time(mFF &lt;- replicate(200, mean(resample(dat, cluster, c(F,F))$val)))
system.time(mTF &lt;- replicate(200, mean(resample(dat, cluster, c(T,F))$val)))
system.time(mTT &lt;- replicate(200, mean(resample(dat, cluster, c(T,T))$val)))
#boxplot(list("FF" = mFF, "TF" = mTF, "TT" = mTT))

# bootstrap accounting for full hierarchy
cluster &lt;- c("hospital","patient","measurement")
system.time(mFFF &lt;- replicate(200, mean(resample(dat, cluster, c(F,F,F))$val)))
system.time(mTFF &lt;- replicate(200, mean(resample(dat, cluster, c(T,F,F))$val)))
system.time(mTTF &lt;- replicate(200, mean(resample(dat, cluster, c(T,T,F))$val)))
system.time(mTTT &lt;- replicate(200, mean(resample(dat, cluster, c(T,T,T))$val)))
#boxplot(list("FFF" = mFFF, "TFF" = mTFF, "TTF" = mTTF, "TTT" = mTTT))

# Monte Carlo for the true sampling distribution
system.time(mMC &lt;- replicate(200, mean(chol(sig) %*% rnorm(250, 0, 1))))
#boxplot(list("MC" = mMC))

boxplot(list("MC" = mMC,
             "F" = mF, "T" = mT,
             "FF" = mFF, "TF" = mTF, "TT" = mTT,
             "FFF" = mFFF, "TFF" = mTFF, "TTF" = mTTF, "TTT" = mTTT))
</pre>
<p>The following figure presents boxplots for the distribution of sample means under the above sequence of bootstrap strategies. The "MC" boxplot summarizes the 'true' distribution of the sample mean (estimated using Monte Carlo). The remaining boxplots are labeled according to the bootstrap strategy used. For instance, the "TF" boxplot corresponds to a multi-stage bootstrap of patients with replacement and measurements-within-patients without replacement (this is commonly called the "cluster bootstrap"), but that ignores the hospital factor. This strategy most closely reflects the data generating mechanism. Notice that sampling all levels of hierarchy without replacement (<em>e.g.</em>, "FFF") simply permutes the indices of the resampled data, and does not confer any variability on the sample mean.<br />
<a href="http://biostatmatt.com/uploads/multi-stage-bootstrap.png"><img src="http://biostatmatt.com/uploads/multi-stage-bootstrap.png" alt="" title="multi-stage-bootstrap" width="600" height="480" class="aligncenter size-full wp-image-2138" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://biostatmatt.com/archives/2125/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Luke is almost 1 year old</title>
		<link>http://biostatmatt.com/archives/2115</link>
		<comments>http://biostatmatt.com/archives/2115#comments</comments>
		<pubDate>Fri, 30 Mar 2012 01:51:57 +0000</pubDate>
		<dc:creator>BioStatMatt</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[birthday]]></category>
		<category><![CDATA[family]]></category>
		<category><![CDATA[friends]]></category>
		<category><![CDATA[Luke]]></category>
		<category><![CDATA[Mary]]></category>

		<guid isPermaLink="false">http://biostatmatt.com/?p=2115</guid>
		<description><![CDATA[]]></description>
			<content:encoded><![CDATA[<p><a href="http://biostatmatt.com/uploads/IMG_0967_squashed.jpg"><img src="http://biostatmatt.com/uploads/IMG_0967_squashed.jpg" alt="" title="IMG_0967_squashed" width="600" height="450" class="aligncenter size-full wp-image-2116" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://biostatmatt.com/archives/2115/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>useR! 2012 Abstract Submission Deadline Today!</title>
		<link>http://biostatmatt.com/archives/2108</link>
		<comments>http://biostatmatt.com/archives/2108#comments</comments>
		<pubDate>Mon, 12 Mar 2012 13:21:46 +0000</pubDate>
		<dc:creator>BioStatMatt</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[conference]]></category>
		<category><![CDATA[R]]></category>

		<guid isPermaLink="false">http://biostatmatt.com/?p=2108</guid>
		<description><![CDATA[useR! 2012 is just around the corner. The deadline for talk and poster abstract submissions is today! Submit your abstract here.]]></description>
			<content:encoded><![CDATA[<p><a href="http://biostat.mc.vanderbilt.edu/wiki/Main/useR-2012"><img src="http://biostatmatt.com/uploads/useRLogoMedium1.png" alt="" title="useRLogoMedium" width="525" height="239" class="aligncenter size-full wp-image-2110" /></a><br />
<a href="http://biostat.mc.vanderbilt.edu/wiki/Main/useR-2012"><em>useR!</em> 2012</a> is just around the corner. The deadline for talk and poster abstract submissions is today! Submit your abstract <a href="http://biostat.mc.vanderbilt.edu/wiki/Main/UseR-2012#Online_Submission">here</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://biostatmatt.com/archives/2108/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>useR! 2012 Early Registration Ending Tomorrow!</title>
		<link>http://biostatmatt.com/archives/2097</link>
		<comments>http://biostatmatt.com/archives/2097#comments</comments>
		<pubDate>Tue, 28 Feb 2012 19:53:15 +0000</pubDate>
		<dc:creator>BioStatMatt</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[useR]]></category>

		<guid isPermaLink="false">http://biostatmatt.com/?p=2097</guid>
		<description><![CDATA[The early registration deadline for useR! 2012 is tomorrow! Visit the Online Registration Website. The fees for registration increase March 1st.]]></description>
			<content:encoded><![CDATA[<p><a href="http://biostat.mc.vanderbilt.edu/wiki/Main/UseR-2012"><img src="http://biostatmatt.com/uploads/useRLogoMedium.png" alt="" title="useRLogoMedium" width="525" height="239" class="aligncenter size-full wp-image-2101" /></a></p>
<p>The early registration deadline for <em>useR!</em> 2012 is tomorrow! Visit the <a href="http://biostat.mc.vanderbilt.edu/wiki/Main/UseR-2012#Registration">Online Registration Website</a>. The fees for registration increase March 1<sup>st</sup>.</p>
]]></content:encoded>
			<wfw:commentRss>http://biostatmatt.com/archives/2097/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Invest or Pay Extra on Mortgage?</title>
		<link>http://biostatmatt.com/archives/2051</link>
		<comments>http://biostatmatt.com/archives/2051#comments</comments>
		<pubDate>Sun, 26 Feb 2012 02:58:17 +0000</pubDate>
		<dc:creator>BioStatMatt</dc:creator>
				<category><![CDATA[Technical]]></category>
		<category><![CDATA[finance]]></category>
		<category><![CDATA[mortgage]]></category>

		<guid isPermaLink="false">http://biostatmatt.com/?p=2051</guid>
		<description><![CDATA[In writing this post, I discovered that this question is very common1,2,3,4, and that my treatment will be relatively simplistic (I almost didn't post it, but this blog has been dark for a while...). There are many issues to consider beyond total worth, including the liquidity of savings versus home equity, tax and tax-sheltered savings, [...]]]></description>
			<content:encoded><![CDATA[<p>In writing this post, I discovered that this question is very common<a href="http://money.msn.com/home-loans/do-not-rush-to-pay-off-that-mortgage-weston.aspx"><sup>1</sup></a><sup>,</sup><a href="http://www.nytimes.com/2010/03/20/your-money/mortgages/20money.html?pagewanted=all"><sup>2</sup></a><sup>,</sup><a href="http://frugaldad.com/2009/02/24/should-i-pay-off-my-mortgage/"><sup>3</sup></a><sup>,</sup><a href="http://www.mymoneyblog.com/thoughts-on-paying-extra-towards-mortgage-principal.html"><sup>4</sup></a>, and that my treatment will be relatively simplistic (I almost didn't post it, but this blog has been dark for a while...). There are many issues to consider beyond total worth, including the liquidity of savings versus home equity, tax and tax-sheltered savings, variability in interest rates, <em>etc</em>.</p>
<p>Consider the problem where a certain amount of monthly income <span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_3a3ea00cfc35332cedf6e5e9a32e94da.gif' style='vertical-align: middle; border: none; padding-bottom:1px;' class='tex' alt="E" /></span><script type='math/tex'>E</script> is available to either invest in savings (<em>i.e.</em>, savings or money market account, CD, mutual funds, stocks, other unwieldy financial instruments<a href="http://en.wikipedia.org/wiki/Financial_instrument"><sup>5</sup></a>), or to make an additional payment towards a home mortgage. The relevant quantities are denoted</p>
<ul>
<li><span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_a9def425f2cc6fbd107b6672115c9bba.gif' style='vertical-align: middle; border: none; ' class='tex' alt="B_l" /></span><script type='math/tex'>B_l</script>, <span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_526906eedcebc63130e1a0eaab2ef29c.gif' style='vertical-align: middle; border: none; ' class='tex' alt="B_s" /></span><script type='math/tex'>B_s</script> - Loan or Savings Balance</li>
<li><span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_44c29edb103a2872f519ad0c9a0fdaaa.gif' style='vertical-align: middle; border: none; padding-bottom:1px;' class='tex' alt="P" /></span><script type='math/tex'>P</script> - Initial Loan (Principal)</li>
<li><span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_98018cddae99cf5001f990f39319cd6e.gif' style='vertical-align: middle; border: none; ' class='tex' alt="R_l" /></span><script type='math/tex'>R_l</script>, <span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_fa8707b4d8465188531c9cc83374a5ee.gif' style='vertical-align: middle; border: none; ' class='tex' alt="R_s" /></span><script type='math/tex'>R_s</script> - Monthly Interest Rates</li>
<li><span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_7fc56270e7a70fa81a5935b72eacbe29.gif' style='vertical-align: middle; border: none; ' class='tex' alt="A" /></span><script type='math/tex'>A</script> - Minimum Monthly Mortgage Payment</li>
<li><span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_8d9c307cb7f3c4a32822a51922d1ceaa.gif' style='vertical-align: middle; border: none; padding-bottom:1px;' class='tex' alt="N" /></span><script type='math/tex'>N</script> - Number of Payments / Investments</li>
<li><span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_3a3ea00cfc35332cedf6e5e9a32e94da.gif' style='vertical-align: middle; border: none; padding-bottom:1px;' class='tex' alt="E" /></span><script type='math/tex'>E</script> - Unallocated Earnings
<li><span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_5dbc98dcc983a70728bd082d1a47546e.gif' style='vertical-align: middle; border: none; ' class='tex' alt="S" /></span><script type='math/tex'>S</script> - Monthly Savings (<span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_17fcbbd5a733975bae3ebc06867557c8.gif' style='vertical-align: middle; border: none; ' class='tex' alt="S \leq E" /></span><script type='math/tex'>S \leq E</script>)
</ul>
<p>Hence, <span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_5dbc98dcc983a70728bd082d1a47546e.gif' style='vertical-align: middle; border: none; ' class='tex' alt="S" /></span><script type='math/tex'>S</script> is the portion of <span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_3a3ea00cfc35332cedf6e5e9a32e94da.gif' style='vertical-align: middle; border: none; padding-bottom:1px;' class='tex' alt="E" /></span><script type='math/tex'>E</script> that is invested, where the remainder of <span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_3a3ea00cfc35332cedf6e5e9a32e94da.gif' style='vertical-align: middle; border: none; padding-bottom:1px;' class='tex' alt="E" /></span><script type='math/tex'>E</script> is used to reduce the mortgage balance. Our problem is to select <span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_5dbc98dcc983a70728bd082d1a47546e.gif' style='vertical-align: middle; border: none; ' class='tex' alt="S" /></span><script type='math/tex'>S</script>.</p>
<p>The balance formulae for savings and loans are, respectively, <p style='text-align:center;'><span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_880eb234a3813c7ebd815ea33e996ad7.gif' style='vertical-align: middle; border: none;' class='tex' alt="\begin{array}{r c l}B_s & = & P_s(1+R_s)^N + S[(1+R_s)^N - 1]/R_s \\ B_l & = & P_l(1+R_l)^N - (A + E - S)[(1+R_l)^N - 1]/R_l\end{array}." /></span><script type='math/tex' mode='display'>\begin{array}{r c l}B_s & = & P_s(1+R_s)^N + S[(1+R_s)^N - 1]/R_s \\ B_l & = & P_l(1+R_l)^N - (A + E - S)[(1+R_l)^N - 1]/R_l\end{array}.</script></p> Derivation of these formulae<a href="http://oakroadsystems.com/math/loan.htm"><sup>6</sup></a> relies on the sum of geometric series<a href="http://en.wikipedia.org/wiki/Geometric_series"><sup>7</sup></a>. For simplicity, we assume that no savings have been accumulated thus far (<em>i.e.</em>, <span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_8cb821c777304557c109ce507bebad12.gif' style='vertical-align: middle; border: none; ' class='tex' alt="P_s = 0" /></span><script type='math/tex'>P_s = 0</script>). In this scenario, the total accumulated value is the sum of home equity and savings <p style='text-align:center;'><span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_295236f1aad6f377c82367509e90c501.gif' style='vertical-align: middle; border: none;' class='tex' alt="\begin{array}{r c l}W & = & P_l - B_l + B_s \\ & = & P_l - P_l(1+R_l)^N + (A + E - S)[(1+R_l)^N - 1]/R_l + S[(1+R_s)^N - 1]/R_s \end{array}" /></span><script type='math/tex' mode='display'>\begin{array}{r c l}W & = & P_l - B_l + B_s \\ & = & P_l - P_l(1+R_l)^N + (A + E - S)[(1+R_l)^N - 1]/R_l + S[(1+R_s)^N - 1]/R_s \end{array}</script></p><br />
By factoring <span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_5dbc98dcc983a70728bd082d1a47546e.gif' style='vertical-align: middle; border: none; ' class='tex' alt="S" /></span><script type='math/tex'>S</script>, we find that <span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_71913e452c5091a5116a9ce53a7b4ae4.gif' style='vertical-align: middle; border: none; ' class='tex' alt="W = C + S\{[(1+R_s)^N-1]/R_s - [(1+R_l)^N-1]/R_l\}" /></span><script type='math/tex'>W = C + S\{[(1+R_s)^N-1]/R_s - [(1+R_l)^N-1]/R_l\}</script>, where <span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_0d61f8370cad1d412f80b84d143e1257.gif' style='vertical-align: middle; border: none; ' class='tex' alt="C" /></span><script type='math/tex'>C</script> is constant with respect to <span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_5dbc98dcc983a70728bd082d1a47546e.gif' style='vertical-align: middle; border: none; ' class='tex' alt="S" /></span><script type='math/tex'>S</script>. Hence, it's optimal to save all of <span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_3a3ea00cfc35332cedf6e5e9a32e94da.gif' style='vertical-align: middle; border: none; padding-bottom:1px;' class='tex' alt="E" /></span><script type='math/tex'>E</script> (<em>i.e.</em>,<span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_84d3a33fde552fc6148664c09686965b.gif' style='vertical-align: middle; border: none; ' class='tex' alt="S=E" /></span><script type='math/tex'>S=E</script>) when <span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_cb293796e346aa26cf814c7c426c1977.gif' style='vertical-align: middle; border: none; ' class='tex' alt="[(1+R_s)^N-1]/R_s" /></span><script type='math/tex'>[(1+R_s)^N-1]/R_s</script> is greater than <span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_66ae0738007d7bac67a4cfc886d5e2a0.gif' style='vertical-align: middle; border: none; ' class='tex' alt="[(1+R_l)^N-1]/R_l" /></span><script type='math/tex'>[(1+R_l)^N-1]/R_l</script>, but to save none of <span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_3a3ea00cfc35332cedf6e5e9a32e94da.gif' style='vertical-align: middle; border: none; padding-bottom:1px;' class='tex' alt="E" /></span><script type='math/tex'>E</script> (<em>i.e.</em>, to apply <span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_3a3ea00cfc35332cedf6e5e9a32e94da.gif' style='vertical-align: middle; border: none; padding-bottom:1px;' class='tex' alt="E" /></span><script type='math/tex'>E</script> towards mortgage principle) when the opposite is true. What's interesting here, and not immediately intuitive, is that optimality depends only on the related interest rates, but not the mortgage balance!</p>
]]></content:encoded>
			<wfw:commentRss>http://biostatmatt.com/archives/2051/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Eisen and Elsevier have words over bill to end NIH Public Access Policy</title>
		<link>http://biostatmatt.com/archives/2041</link>
		<comments>http://biostatmatt.com/archives/2041#comments</comments>
		<pubDate>Wed, 11 Jan 2012 02:26:54 +0000</pubDate>
		<dc:creator>BioStatMatt</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[copyright]]></category>
		<category><![CDATA[NIH]]></category>
		<category><![CDATA[publisher]]></category>
		<category><![CDATA[R]]></category>

		<guid isPermaLink="false">http://biostatmatt.com/?p=2041</guid>
		<description><![CDATA[This article linked below and the subsequent comments (some apparently from an Elsevier rep.) are interesting. It's not directly related to R, but is related to open source/science philosophy. Elsevier-funded NY Congresswoman Carolyn Maloney Wants to Deny Americans Access to Taxpayer Funded Research]]></description>
			<content:encoded><![CDATA[<p>This article linked below and the subsequent comments (some apparently from an Elsevier rep.) are interesting. It's not directly related to <tt>R</tt>, but is related to open source/science philosophy.</p>
<p><a href="http://www.michaeleisen.org/blog/?p=807">Elsevier-funded NY Congresswoman Carolyn Maloney Wants to Deny Americans Access to Taxpayer Funded Research</a></p>
]]></content:encoded>
			<wfw:commentRss>http://biostatmatt.com/archives/2041/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Two Quotes to Summarize Opposing Positions on “Is Bayes Posterior just Quick and Dirty Confidence?”</title>
		<link>http://biostatmatt.com/archives/2023</link>
		<comments>http://biostatmatt.com/archives/2023#comments</comments>
		<pubDate>Mon, 09 Jan 2012 22:04:46 +0000</pubDate>
		<dc:creator>BioStatMatt</dc:creator>
				<category><![CDATA[Technical]]></category>
		<category><![CDATA[Bayesian]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://biostatmatt.com/?p=2023</guid>
		<description><![CDATA[A central theme of Don Fraser's article, titled "Is Bayes Posterior just Quick and Dirty Confidence?", was that Bayesian confidence regions have approximate, and sometimes poor frequentist coverage (i.e., the frequency with which a confidence region contains the true parameter value under repeated sampling). Fraser has this warning: The failure to make true assertions with [...]]]></description>
			<content:encoded><![CDATA[<p>A central theme of <a href="http://projecteuclid.org/euclid.ss/1320066918">Don Fraser's article</a>, titled "Is Bayes Posterior just Quick and Dirty Confidence?", was that Bayesian confidence regions have approximate, and sometimes poor frequentist coverage (<em>i.e.</em>, the frequency with which a confidence region contains the true parameter value under repeated sampling).</p>
<p>Fraser has this warning:</p>
<blockquote><p>
The failure to make true assertions with a promised reliability can be extreme with the Bayes use of mathematical priors (Stainforth et al., 2007; Heinrich, 2006). The claim of a probability status for a statement that can fail to be approximate confidence is misrepresentation. In other areas of science such false claims would be treated seriously.
</p></blockquote>
<p>The complaint about coverage of Bayesian confidence regions arises often, I think, because there is a very ingrained notion that correct frequentist coverage is a most desirable quality; that frequentist coverage is a literal statement about a natural phenomenon. Fraser goes further to say that confidence regions with incorrect coverage are misrepresentative, perhaps even fraudulent (<em>i.e.</em>, a thing to be 'treated seriously')!</p>
<p>Of course, frequentist coverage is almost never a literal statement about a natural phenomenon, because statistical models almost never fully reflect the truth. In the sentiment of G.E.P. Box, all reported confidence levels are wrong, but some are useful.</p>
<p>More importantly, the criticism of approximate frequentist coverage is readily dismissed from a Bayesian perspective. In <a href="http://projecteuclid.org/euclid.ss/1320066919">response</a>, Christian Robert had this final comment:</p>
<blockquote><p>
Bayesian credible intervals are not frequentist confidence intervals and thus do not derive their optimality from providing an exact frequentist coverage.
</p></blockquote>
<p>On a side note, Fraser's statement above seems to neglect that frequentist probability is different from Bayesian probability in the context of confidence regions! I wrote on a related topic a few weeks ago: <a href="http://biostatmatt.com/archives/1812">Bayesian vs. Frequentist Intervals: Which are more natural to scientists?</a> Unfortunately though, my understanding is upset again due to Robert's reference to Jaynes: "there is only one kind of probability". How can this be true? Aren't Bayesians clear that the posterior distribution on a parameter is not to be interpreted in a frequentist way? And aren't frequentists clear that confidence regions not to be interpreted in a subjective way?</p>
]]></content:encoded>
			<wfw:commentRss>http://biostatmatt.com/archives/2023/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>useR! 2012 Simple Abstract Helper</title>
		<link>http://biostatmatt.com/archives/1933</link>
		<comments>http://biostatmatt.com/archives/1933#comments</comments>
		<pubDate>Tue, 03 Jan 2012 21:20:51 +0000</pubDate>
		<dc:creator>BioStatMatt</dc:creator>
				<category><![CDATA[Technical]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[Sweave]]></category>
		<category><![CDATA[useR!2012]]></category>

		<guid isPermaLink="false">http://biostatmatt.com/?p=1933</guid>
		<description><![CDATA[useR! 2012 has issued a call for abstracts! I've extended the WebSweave concept to offer a tool to create simple abstracts online, including those with \LaTeX markup, which may then be submitted at the conference website. Use the following link for the Simple Abstract Helper.]]></description>
			<content:encoded><![CDATA[<p><em>useR! 2012</em> has issued a <a href="http://biostat.mc.vanderbilt.edu/wiki/Main/UseR-2012#Call_for_Abstracts_and_Tutorial">call for abstracts</a>! I've extended the <a href="http://biostatmatt.com/archives/1184">WebSweave</a> concept to offer a tool to create simple abstracts online, including those with <span class='MathJax_Preview'><img src='http://biostatmatt.com/wp-content/plugins/latex/cache/tex_c51d7e23458ca0e7373a8ed6ab56b2b9.gif' style='vertical-align: middle; border: none; ' class='tex' alt="\LaTeX" /></span><script type='math/tex'>\LaTeX</script> markup, which may then be submitted at the conference website. Use the following link for the <a href="http://biostatmatt.com/user-2012-abstract-helper">Simple Abstract Helper</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://biostatmatt.com/archives/1933/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

