Monday, December 15, 2008

Monbiot and Birol

George Monbiot meets ... Fatih Birol on video for 12 minutes.

Accompanying story here.

Sunday, November 09, 2008

Comprehensive Oil Depletion Model Life-Cycle

Setting the Stage
In light of this fact, it should be no surprise that the possibility that world oil production will soon reach a peak and then inexorably decline is a subject of great interest and intense debate. As noted by Dr. Greene, the “pessimists,” a somewhat pejorative label given to those who are convinced that the oil peak is imminent and that its consequences will be dire, assert that world oil supply is chiefly determined by the geology of oil resources.
The term "pessimist" has no meaning anymore. Can you still call a person who questioned the run-up of hedge funds and derivatives, not to mention the stock market for the last 10+ years, a pessimist without incurring any sense of shame? All the models in that world got built to advance one motive -- that of profit, brought about by no small part greed. No one really cared whether they made sense in some theoretical framework, and why should they, since human nature would constantly batter down the model's premises in search of escape clauses. This occurred all in the nature of one-upping the next guy ... in a zero-sum game of zero-summed optimism.

So I have general agreement with "Dr.Greene" in that geology determines the oil depletion arc almost completely. I would even take it a step further and also claim that probability and statistics plays an even greater role. The following comprehensive framework, that has essentially described the information content of this blog the last few years, maps out what I have tried to achieve. Don't confuse this with any kind of wacked-out psychohistory-styled prediction of the future though. Although it shares some grand aspects of a statistics-based forecasting outlook, I think I know enough when to give it a rest and not to try to predict collective human actions.



I don't consider the math behind the models that difficult to understand. The two darkened bubbles above contain the mathematical foundation for parts of the discovery process along with the oil production model. Everything fits together like a glove (IMHO and after several years of effort) and the interpretations of the model replace a longstanding set of heuristics that in the past have served to cobble together a rather poorly-formed understanding of the aggregated oil production life-cycle. The last piece in the puzzle occurred in the past few months, as I tried to decipher a model for field-size distributions.

As I spent most of my time working my way backward in the oil life-cycle timeline (right to left in the above figure), I can't provide a linear description of the comprehensive model's development. In the interim, the following bullet list of links provides reference points for entry into a mind-blowing range of pessi-optimism or opto-pessimism ... depending on your point of view.
If you enjoy the art of rhetoric more than the dialectic, beware. For many, this stuff brings on MEGO, but for a few die-hards you can get your math-porn for free.

Saturday, November 08, 2008

Investment in Knowledge

Nate Hagens over at TOD posted a brilliant comment in response to Exxon denial:
I am beginning to believe it is an advantage to NOT work in the oil industry to understand oil. These people have been wrong, are wrong, and are about to be VERY wrong with their understanding of what peak oil means. Peak Oil has many definitions, but the most common is the all time high in world annual production of crude oil. Resources have little to do with it. (There are probably 10 million earthworms on my property -but even with a team of people and the best equipment I might only get a fraction of them). Higher prices and higher technology have little to do with Peak Oil, which has to do with cheap, reliable flow rates. There is not the slightest evidence that market theories (or activities) has helped find any more oil and gas (in the United States) since price-induced drilling increases had essentially zero impact on the production (or finding) of oil and gas.

Lets scrap the word 'peak oil' for the moment. To the economists and cornucopians at Exxon, the API, the EIA, etc. I ask these questions:

1)Do you expect oil production costs to get cheaper over time?

2) Have we past the point of cheap oil? (which is what matters - who cares if we can get an extra 20 mbpd if it takes more energy, more steel, more water and costs $500 per barrel)

3)Will the energy and other resources you use to procure oil and natural gas increase or decrease in the future?

4)Irrespective of resource or reserves, what will be the highest, reasonably low priced (say under $80 cost), FLOW rate that you can consistently provide that is not subject to geopolitical disruptions at the margin? (i.e. is there a perpetual cushion in case something goes wrong)

5)What is the error band and confidence interval you assign to your above answers? Are you willing to stake the future of industrial civilization on your answers? Even if there is a 5% chance that the resources you see translate to regular, cheap, flow rates of high quality oil, that is too big of a risk for society to take (and I think it is much higher than that).

You people are asking the wrong questions, because you've been focused on what you believe is the most important aspect of the problem - where IS the oil. That is a small part of the many more important questions, yet the group think and myopia has created an enormous blindspot. A couple months ago, if I would have told you the Federal Reserve would DOUBLE its balance sheet since the end of September, would anyone have believed me? Well, they did. Rules and facts change. Correlations that worked in the past are now uncorrelated. What was uncorrelated in the past is now completely correlated. Thats why economics isn't science. Its based on a moving target. Economists at the oil companies are trained to think in resources and price, not in energy costs and externalities. They will not see this Black Swan until it bites them in the ass.

I don't believe in any kind of investment anymore besides an investment in knowledge. Forget investing in the stock market. Forget about investing in new ways to find oil and that "Drill, Baby, Drill" crap. Forget investing in any kind of investment, except for the one between our collective two ears.

Wednesday, November 05, 2008

USA Field Size Distribution

This dated paper (circa 1986) on USA oil field size has a few interesting statistics. I don't have any of the charts, but the authors roll up a few of the numbers from the curves.
The distribution of about 14,000 United States oil fields (Figure 2)--a partial sample of those in the lower 48 states--illustrates the importance of the larger fields. The sample includes almost all larger fields as known about 1970 and excludes many tiny fields as well as all of the more recent discoveries. The lower dotted line is made of 13,985 points representing the fields ordered according to increasing reserves size. Only 440 fields, or about 3%, are major ones larger than 50 million bbl. The upper curve tracks the percentage of the total oil volume occurring in fields greater than each size. From this curve, we read that the major fields, constituting only 3% of the total number, contain 80% of the total oil. Obviously, in this type of distribution one can account for the bulk of the oil by assessing the larger field possibilities only.

Selecting an effective minimum field-size cutoff is very important, because it affects every major factor in the assessment--the prospects to be counted and the success and risk levels, as well as the average field size. Normally, the minimum size is taken at or just below the assumed economic minimum for the area. This approach ensures that all prospects of real interest are included. It also avoids getting bogged down in hundreds or thousands of fields that are inconsequential to early exploration stages. Furthermore, the comparative data base for assessing sub-economic fields is very weak, as the true sizes of these fields have rarely been scaled. If desired, one can assess the small fields by statistical extrapolation or by estimating a lump-sum proportion from a volume curve like that of Figure 2.

Economic limits always truncate the lower ends of observed field-size distributions (Arps and Roberts, 1958; Kaufman et al, 1975; Grender et al, 1978; Drew et al, 1982; Vinkovetsky and Rokhlin, 1982). In nature's distribution, numbers of deposits probably increase progressively in successively smaller sizes down to droplets and molecules; such a distribution is not lognormal. But we deal exclusively with artificially truncated distributions whose plots almost invariably curve upward near the low-side truncation point (upper curve, Figure 3). Our United States distribution (lower curve, Figure 2) has no data below 1,000 bbl, and many of the data points below 10,000 bbl, where the graph ends, are questionable. If the plot were continued to the left, it would ultimately curve upward at t e point of economic truncation beyond which there are no data.

We use the computational convenience of the lognormal distribution, appropriately truncated, but would not argue that this scheme is better or worse than other computational ones for strongly right-skewed distributions that have many more little fields than big fields. Some investigators (e.g., Ivanhoe, 1976; Folinsbee, 1977; Coustau, 1980) plot field size bilogarithmically against rank order. For our assessment approach, we must normalize field numbers at this stage by plotting "percent greater than" against log size. Depending on purpose and data, we may express field size as recoverable volumes of oil or of gas, or of oil plus gas on an energy-equivalent basis.

These numbers (the 50Mb and the less than 0.1Mb) fill in the following points on the previous post.


The authors state that 3% of the highest rank of fields contain 80% of the oil, which includes rank up to 440 in this chart. Or this means that 20% of the oil resides in the lowest 97% of the rank.
Up/U = (ln(1+Size/c) - 1/(1+Size/c)+ 1)/(ln(1+Max/c) - 1/(1+Max/c) + 1)
I would like to make sense of this but as I don't have the original charts, I am guessing as to whether the 80% makes sense. With the Dispersive Aggregation model for the extra 21,000 fields reported, the amount of oil contributed to those above 50Mb has dropped to 50%.






YearSizeFraction of Total
#Fields
1986>50Mb0.8013,985
Today>50Mb0.5134,969
In the last 20 years, we probably have gained a lot of extra mileage from the low volume fields, but I can't say for certain without filling in the historical data points. If someone can get a hold of this Baker paper and post some of the charts, it would help immensely.

These authors also believed that the field size distribution followed a log-normal. The Dispersive Aggregation mimics the general shape of the log-normal under certain regimes, especially under a wide variance, while at the same time generating the heavy Pareto-like tail (i.e. 1/xp) that much of the data seems to indicate.


The following chart shows a cumulative field size distribution from the Canadian Williston Basin. The referenced article shows how they use a log-probability chart to map onto a log-normal curve; I pasted on a set of points corresponding to Dispersive Aggregation and as you can see, it mimics the behavior of a log-normal distribution.
Free Image Hosting at www.ImageShack.us




Baker, R.A., H. M. Gehman, W. R. James, D. A. White, 1986,
Geologic Field Number and Size Assessment of Oil and Gas Plays,
in Oil and Gas Assessment: Methods and Applications: AAPG Studies in Geology
No. 21, p. 25 – 31.

Tuesday, November 04, 2008

Franken

Update: RECOUNT! The only reason I put up this post was that I knew it was going to be close, and Coleman is pure sleaze. I did an absentee and noticed that my ballot had ink already partially filled in the Senate column for one of the minor candidates. For once, I'd like to know if my vote got counted....



For fellow Minnesotans reading this, vote Al Franken for Senate.

Norm Coleman is a corrupt, incompetent bureaucrat who has wasted precious intellectual resources pursuing the exaggerated "Oil for Food scandal" leading up to the Iraq war. Prior to the oil crunch this summer, Oil for Food was the closest that the Republicans ever got to even talking about the global oil situation, and it turned out to be a complete smoke-screen to cover their own interests. As the previous ranking member of the Permanent Subcommittee on Investigations, we can all blame Norm Coleman for running interference on one of the most effective head-fakes perpetrated in avoiding the truth. So much for investigatin' skills there, Norm

For fellow Wright County folks, I have no doubt you will say no to the imbecilic Michele Bachmann, in favor of the El-Train.

Wednesday, October 29, 2008

Significant, no hyperbole

Does anybody else even look at this stuff? Here we have one of the biggest issues facing the world, that of peak oil, and we don't even know how to analyze the data. Sigh. Oh well, here we go once again.

Let us look at the Dispersive Discovery model and the relationship describing cumulative reserve growth for a region:
DD(t) = 1 / (1/L + 1/kt) => Ureserve(t)
where t=time from the initial production. It gets a bit tricky to nail down the initial value for growth, as that has a big influence on the ultimate growth factor. The curve basically looks like this:


From noting in the last post that this same dependence can occur for field sizes, I realized that an interesting mapping into reciprocal space makes these curves a lot easier to visualize and to extrapolate from. So instead of plotting t against U, we plot 1/t against 1/U
1/U(t) = 1/L + 1/kt
On a linear-linear x-y plot where x maps into 1/t and y into 1/U, the linear curve looks like this:


For field size distributions, it looks like this on a log-log plot. This shows up clearly as a straight line over the entire range of the reciprocal values for the variants, if you pull the constant term into one of the two variants. This works out very well for size distributions if we scale the values to an asymptotic cumulative probability of 1.


Laherrere has long referred to "hyperbolic" plots in fitting to creaming curves. If you Google for hyperbolic Laherrere, you will find several PDF files where he describes how well he can match the growth temporal behavior to one or more "hyperbolic" curves. However, I can find no mention of Laherrere's description or derivation of the hyperbolic other than its use as a heuristic. I posted about Laherrere's use of hyperbolic curves before without understanding what he meant by them. Even the thoroughly discredited cornucopian Michael Lynch could not find an explanation:
No explanation is given for the 'hyperbolic model' or why ordering by size is more appropriate than by date of discovery.
I always assumed that a hyperbolic function either meant a slice through a conic section or this type of dependence:
y = 1 / x + c
However, none of these two behaviors really match what we see. But then a coconut fell on my head and I finally realized Laherrere probably meant the alternate version that I show in the first equation above.
1 / y = 1 / x + c
The following graph shows a typical Laherrere creaming curve analysis, where he fits to a couple of hyperbolic functions. Note that he refers to "hyperbola" in the legend. As the mellon-colored curve I plot the Dispersive Discovery cumulative . I had to slide it off Laherrere's hyperbola curve slightly since the two match exactly, which means that you couldn't tell the two apart and one curve would obscure the other.

I have no doubt that Laherrere's hyperbolas map precisely into the linear Dispersive Discovery curves. I just find it unfortunate that it has taken this long to figure this key piece of the puzzle out. So we have turned the hyperbolic fit heuristic into a model which comes about from a well understood physical process, that of Dispersive Discovery.

On another positive note, I think the reciprocal fitting (Dispersion Linearization) may become just as popular as Hubbert Linearization. From what I have gathered, people seem to appreciate straight lines more than curves. Herewith, from a classic Simpson's episode:
Professor Frink: [drawing on a blackboard] Here is an ordinary square....

Police Chief Wiggum: Whoa, whoa - slow down, egghead!
I am currently reading one of Professor Deirdre McCloskey's rants against non-scientific method mumbo-jumbo and feel a bit better at fighting the good fight against using arbitrary heuristics and qualitative analysis. Yet, in the long run, will anyone really care unless you can simplify it to two points on a line segment?

Friday, October 24, 2008

Estimating URR from Dispersive Field Size Aggregation

This post continues from the analysis of oil field size distribution from a few days ago. That discussion ended with questions regarding the utility of the analysis for arbitrary regions. It seemed to work well for North Sea oil but eyeballing for Mexico, the Parabolic Fractal Law looked a better empirical fit.

I did not talk much about the statistics of the rank histograms in the previous post. To remedy that I ran a few Monte Carlo simulations to determine what noise we can expect on the histograms. In particular, I went back to the North Sea data. The figure below shows a Monte Carlo run for the Dispersive Aggregation model where I sampled from the inverted distribution with P acting as a stochastic variate:
Size=c*(1/P-1) where P=[0..1]
For a run of 200 samples, the results look like:

One can linearize this curve by taking the reciprocals of the variates and replotting. Note the sparseness of the endpoints which means that random fluctuations could change the local slope significantly (which has big implications for the Parabolic Fractal Model as well).

Plotting the MC data simulated for 430 point on top of the actual North Sea data and we get this:

The following figure gives a range for the single adjustable parameter in the model. For the North Sea oil, I replotted using the MaxRank and two values of C which bounded the maximum value.



The parameter C acts like a multiplier so it essentially moves the curves up or down on a log-log plot.


One can try to estimate URR from the closed-form solution but as I said before, the lack of a "top" to the data makes it unreliable. The actual distribution goes like 1/Size so that this integrates to the logarithm of a potentially large number -- in other words, it diverges. So unless one can put a cap on a maximum field size, ala the PFM's curvature, the URR can look infinite. From the model's perspective, one can emulate this behavior by not allowing a narrow window of probability for those large reservoir sizes.

In terms of geological time, we have one finish line, corresponding to the current time. But the growth lifetimes forr the dispersion to occur correspond roughly to all the points between now and the early history of oil formation some 300 million years ago. So we have to integrate to average out these times.
Cumulative (PDF(Size)) =
Integral (PDF(Size)dSize) from 0 .. Now =
Integral (PDF(kt)dt) from 0 .. Now
where the value of Now you can consider as roughly 300 million years from the start of the oil age. Small values of T correspond to the start of dispersion at longer times ago and higher values result in values closer to the present (Now) time. The number T itself scales proportionately to the rank index on a field distribution plot if dispersion proceeds more or less linearly with time (kT ~ Size). Also, a rank value of 1 corresponds to the largest value on a rank histogram plot from which can estimate Maximum Field Size. Given a mature enough set of field data, this provides close to the ceiling for where fields cannot aggregate.

We essentially blank-out a probability window for field sizes above a certain value. This gives the following from the Dispersion relation:
P(Size) = K*Integral (C/(Size+C)^2 dSize) from Size = [0..L]
P(Size) = Size*(L+C)/(Size+C)/L


... inverting

Size = C*P/(1+C/L-P)
where P=[0..1]
The following set of curves shows the dispersive aggregate growth models under the conditions of a maximum field size constraint, set to L=1000.


Converting this graph to a rank histogram and you can notice an interesting stretching going on. Since we do have a constraint on field size, we can calculate an equivalent URR for the area under the curve.


We need to use the rank histogram to get the counting correct. Then the URR derives to:
URR = MaxRank * C * ((1+C/L)*ln (1+L/C) - 1)
for most cases, this approximates to:
URR ~ MaxRank * C * (ln (L/C) - 1)
Note that the URR has a stronger dependence on the parameter C than the maximum field size L , which has a weak logarithmic behavior. I will discuss the case of the USA further down this post but keep in mind that Americans have more oil fields by far than anyone else in the world, i.e. a huge MaxRank, yet our URR does not swamp out everyone else.

To test the model against reality, I retrieved field size data from Khebab's post on oil field sizes and Laherrere's paper on "Estimates of Oil Reserves".
  • North Sea (see above)
  • Mexico
  • Norway
  • World (minus USA/Canada)
  • West Siberia
  • Niger
Plus one estimate
  • USA
This chart from Laherre shows data from Mexico superimposed with the Dispersive Aggregation model (no field size constraint). Note that the field Canterell may fall in the predicted path and not form some sort of outlier (as some have suggested due to it meteorological origin).
Free Image Hosting at www.ImageShack.us

For Norway (courtesy of Khebab and Laherrere) we get the following two curves with data separated in time by several years. Note how the Maximum Rank shifts right as the value of C grows with time. Actually this shows that we may have some difficulty in separating out the decision of not developing smaller fields with an actual physical limit on the number of small fields that we count as production-level discoveries. (Caution: The values for C are in Gb, so they have to be mutiplied by 1000 to match the other C's in this post)

Free Image Hosting at www.ImageShack.us
Old data

Free Image Hosting at www.ImageShack.us
Recent data

The World data plot (excluding USA and Canada) from Laherrere does not collect rank info from the smaller oil fields, so the vertical asymptote shown here gives a prediction of a Maximum Rank, approximately 9000 fields worldwide.

This gives a range in URR's from 1100 Gb to 1850 Gb, for values of C from 15 to 25 and a Maximum Field size of 250 Gb. I estimated the MaxRank for this model from Robelius' thesis.
An article by Ivanhoe and Leckie (1993) in Oil & Gas Journal reported the total amount of oil fields in the world to almost 42000, of which 31385 are in the USA. According to the latest Oil & Gas Journal worldwide production survey, the total number of oil fields in the USA is 34969 (Radler, 2006). The number of fields outside the USA is estimated to 12500, which is in good accordance with the number 12465 given by IHS Energy (Chew, 2005). Thus, the total number of oil fields in the world is estimated to 47 500.
From Khebab, the PFM gives a low end estimate ignoring the missing parts of the rank histogram:
Using his (Laherrere's) parameters, we can compute a world URR (excluding the US and Canada, conventional oil) equals to 1.250 Trillions of Barrels (Tb) without considering oil fields with sizes below 50 Mb.
This chart from West Siberia bins histogram data on a linear plot.



Niger Delta data does not work very well at all. This could potentially work well as a candidate for constrained reservoir sizes, yet we can not rule out the possibility that some large fields have avoided discovery thus far.
Free Image Hosting at www.ImageShack.us

I haven't found a field size distribution yet for the USA alone, but I generated the following figure as a prediction. I used a maximum rank of 34500 from Robelius's thesis and came up with two curves, with one assuming a maximum field size of 10 Gb (lower curve). The latter corresponds to a URR of 185 Gb. If I used C=0.7 and Max Field of 15 Gb, then I get URR pf 217 Gb. Ideally, I would like to get data from the USA (fat chance) to see how closely the Dispersive theory will agree with such a large (34,500) statistical sample.



Overall, most of the characteristic size (C) parameters for all the field size distribution curves fall in the range of 15 to 30 Mb (Siberia at 44), except for the USA which looks definitely less than 1 Mb. What exactly does this mean? For one, it means that the USA has a much higher fraction of smaller oil fields than the rest of the world. Is this actually due to more resources invested into prospecting for smaller oil fields than the rest of the world? Or is it because the USA has a physical preponderance of smaller oil fields? I don't know. Yet, the latter does make some sense considering how much more reserve growth that the USA shows than the rest of the world (and the number of stripper wells we have). Slower reserve growth occurs for exactly the same reason -- slower relative dispersion in comparison to distance involved -- that it does for dispersive aggregation. After all, I constructed the underlying model in identical ways, substituting natural discovery in Dispersive Aggregation for man-made discovery in Dispersive Reserve Growth.

You can well ask why the curve nosedives so steeply near the maximum rank. It really only looks that way on a log-log plot. Actually the distribution flattens out near zero and this creates a graphical illusion of sorts. The dispersion model says that up to a certain recent time in geological history, many of the oil fields have not started dispersing significantly -- at this point the slow rates have not yet made their impact and the fast rates haven't had any time to evolve. This manifests as an unknown distribution of sizes for oil fields before this point. (If you plot a population's yearly income on a rank histogram you will see this same effect, in this case due to a similar truncation due to a slow income growth early in a career). The USA field essentially has a much slower dispersive evolution than the rest of the world, so we have a much higher fraction of small fields that have not aggregated.

The Dispersive Field Size Aggregation falls into the Dispersion Theory category of models that seem to have a high degree of cohesion and connectedness. It looks like we can actually connect the dots from dispersive field sizes to the Logistic shape of the Hubbert Peak since the underlying statistical fundamentals have much commonality in terms of temporal and spatial behaviors. For now I can't find a derivation of the Parabolic Fractal Law, which to me looks like a heuristic. I always base my observations on a model. I don't lock-step believe in heuristics, mainly because I have a perhaps unhealthy obsession with understanding why a heuristic works at all (review my railing against the "derivation" of the Logistic that I have frequently written about, until I figured it out to my satisfaction). By definition, a heuristic does not have to explain anything, it just has to describe the results. And describing the results in a mathematical equation does not cut it for me. For all I know, all the Wall Street quantitative analysts (the "quants") have based all their derivative and hedge fund "models" on heuristics -- and look a where that has got us.

In my mind, Dispersive Aggregation makes a lot of sense and it seems to fit the data. I smell another TOD post brewing.