Randomization in Characterizing the Subsurface
Current methods for characterizing Earth’s subsurface, such as standard inverse techniques, are not sufficiently accurate to meet the needs of modern applications in the fields of energy exploration, environmental management, and global security. While increasing the quantity of field measurements and robustness of the applied data-/model-analysis methods can improve accuracy, such approaches can be computationally impractical for large data sets and complex site conditions. Therefore, there is a need to develop economically-feasible and robust computational methods while maintaining accuracy. For example, in-field drilling for geothermal operations may yield high failure rates, resulting in unacceptably high costs; errors and/or large uncertainties in the estimated subsurface characteristics are the main impediment to the successful siting of an in-field well. This problem is not uniquely geothermal. Accurate characterization of uncertain subsurface properties is also critical for monitoring storage of carbon dioxide, estimating pathways of subsurface contaminant transport, and supervising ground-based nuclear-explosion tests.
We have developed various methods to characterize the subsurface, including efficient computational strategies to identify subsurface permeability given a set of hydraulic heads, as shown in Figure 1, and a data-driven subsurface geological feature detection approach using seismic measurements, as shown in Figure 2. A major challenge for many subsurface applications is the large number of observations and high feature dimensionality.
Randomized matrix algorithms—which aim to construct a low-rank approximation of an input matrix—have received a great deal of attention in recent years. The low-rank approximation, often called a matrix “sketch,” is usually the product of two smaller matrices, which yields a good approximation that represents the original output’s essential information. Therefore, one can employ a sketching system as a surrogate for the original data to compute quantities of interest. We have employed randomization techniques to solve various large-scale computational problems. Here we provide examples to demonstrate two major applications in solving real-world subsurface problems.

Randomized Subsurface Permeability Estimation
A porous medium’s permeability is a physical quantity needed to predict flow and transport of fluids and contaminants in the subsurface. The permeability’s estimation is often posed as a regularized inverse problem
where
The solution to
where
One may obtain
However, solving
Our approach aims to construct a sketching matrix, the elements of which are drawn randomly from a Gaussian distribution. We then replace the data
and
Subsurface Geological Feature Detection Using Randomized Data-Driven Methods
Seismic waves are more sensitive to the acoustic/elastic impedance of the subsurface than other geophysical measurements (see Figure 2). Hence, seismic exploration has been widely used to infer heterogeneities in media impedance, which indicate geologic structures.
![<strong>Figure 2.</strong> Diagram of the data-driven procedure to learn geologic features from seismic measurements. Image courtesy of [2] and [3].](/media/zmhptmr0/fig1.jpg)
Analyzing and interpreting seismic measurements for identifying prospective geological features is challenging. The difficulties arise from the processing of large amounts of seismic data and the incorporation of subjective human factors. Different geologic features play different roles in characterizing subsurface structure. In particular, identifying geological fault zones is essential to many subsurface energy applications. In carbon sequestration, potential leaks of stored carbon dioxide can create geologic faults, so knowing fault locations is necessary to monitor carbon dioxide storage. We have developed a novel data-driven geological feature detection method and successfully applied it to seismic measurements [2, 3], as illustrated in Figure 2. Both historical and simulated seismic data are fed into learning algorithms. A detection function
Suppose one has
where
where
However, the direct utilization KRR prediction in
The Nyström method computes a low-rank approximation
![<strong>Figure 3.</strong> Illustration of the Nyström approximation. Image courtesy of [2] and [3].](/media/kzeoyrk1/fig3.jpg)
With the low-rank approximation obtained via the Nyström method, we can efficiently calculate an approximated solution
where the latter equality follows from the Sherman-Morrison-Woodbury matrix identity. It is worthwhile mentioning that the
Acknowledgements: This work was co-funded by the U.S. Department of Energy (DOE) Office of Fossil Energy’s Carbon Storage program, Los Alamos National Laboratory Environmental Programs, and the DOE Office of Science (DiaMonD project: An Integrated Multifaceted Approach to Mathematics at the Interfaces of Data, Models, and Decisions, Grant #11145687).
References
[1] Lin, Y., Le, E., O’Malley, D., Vesselinov, V., & Bui-Thanh, T. (2017). Large-Scale Inverse Model Analyses Employing Fast Randomized Data Reduction. Wat. Resc. Res, 53(8), 6784-6801.
[2] Lin, Y., Wang, S., Thiagarajan, J., Guthrie, G., & Coblentz, D. (2017). Efficient Data-Driven Geologic Feature Detection from Pre-stack Seismic Measurements using Randomized Machine-Learning Algorithm. Preprint, arXiv:1710.04329.
[3] Lin, Y., Wang, S., Thiagarajan, J., Guthrie, G., & Coblentz, D. (2017). Towards Real-Time Geologic Feature Detection from Seismic Measurements using a Randomized Machine-Learning Algorithm. In SEG Technical Program Expanded Abstracts 2017 (pp. 2143-2148). Houston, TX: Society of Exploration Geophysics.
About the Authors
Youzuo Lin
Staff scientist, Los Alamos National Laboratory
Youzuo Lin is a staff scientist in the Earth and Environmental Sciences Division at Los Alamos National Laboratory.
Daniel O’Malley
Staff scientist, Los Alamos National Laboratory
Daniel O’Malley is a staff scientist in the Earth and Environmental Sciences Division at Los Alamos National Laboratory.
Velimir V. Vesselinov
Staff scientist, Los Alamos National Laboratory
Velimir Vesselinov is a staff scientist in the Earth and Environmental Sciences Division at LANL and a principle investigator of several Department of Energy-funded projects related to environmental management.
George D. Guthrie
Geochemist, Los Alamos National Laboratory
George D. Guthrie is a geochemist in the Earth and Environmental Sciences Division at Los Alamos National Laboratory.
David Coblentz
Staff scientist, Los Alamos National Laboratory
David Coblentz is a R&D Manager and staff scientist in the Earth and Environmental Sciences Division at Los Alamos National Laboratory.
Stay Up-to-Date with Email Alerts
Sign up for our monthly newsletter and emails about other topics of your choosing.