Version:

Full-Waveform Inversion

Part 4 - Iterations

Copy
%matplotlib inline
import numpy as np
import os
import salvus.namespace as sn
SALVUS_FLOW_SITE_NAME = os.environ.get("SITE_NAME", "local")
p = sn.Project(path="project")
The inverse problem is a similar entity like all the other configuration objects. We want to be able to keep track of all the individual steps, and avoid unnecessary repetitions of the same task. Fortunately, SalvusProject takes care of data management and book keeping.
We need to specify the prior, which is just the SimulationConfiguration object of the initial model we created above. Furthermore, we need to specify all possible events that we might consider during the inversion. This could be a subset of events defined in the project, and we could add more events later on. Together with the events, we need to pass the observed data. Because we created it synthetically, this is also just a SimulationConfiguration object. The remaining parameters specify which parameters to invert for (VP and RHO), what misfit functional to use, preconditioner and descent method, and where to run the simulations.
p += sn.InverseProblemConfiguration(
    name="my_inversion",
    prior_model="initial_model",
    events=p.events.list(),
    mapping=sn.Mapping(scaling="absolute", inversion_parameters=["VP", "RHO"]),
    preconditioner=sn.ConstantSmoothing({"VP": 0.01, "RHO": 0.01}),
    method=sn.TrustRegion(initial_trust_region_linf=10.0),
    misfit_configuration="L2",
    wavefield_compression=sn.WavefieldCompression(
        forward_wavefield_sampling_interval=10
    ),
    job_submission=sn.SiteConfig(
        site_name=SALVUS_FLOW_SITE_NAME, ranks_per_job=4
    ),
)
Instead of monolithic inversions and a linear flow of iteration, SalvusOpt uses a tree-based framework of iteration. At any point during the inversion, you can branch off, modify the settings, and run one or more streams simultaneously.
Ready for our first iteration? Without specifying any additional information, all parameters will be inherited from the InverseProblemConfiguration.
p.inversions.add_iteration(inverse_problem_configuration="my_inversion")
[2025-01-09 22:04:47,664] INFO: Adding new iteration #0.
True
Many steps within a single iteration involve expensive simulations, e.g., for computing misfits or adjoint simulations to compute gradients. In order to be able to closely monitor the progress, SalvusOpt steps through an iteration, and automatically dispatches simulations whenever necessary. The function resume will return whenever SalvusOpt is waiting for other tasks to finish first. Calling it several time, will step through the iteration in sequence.
p.inversions.resume(
    inverse_problem_configuration="my_inversion",
)
[2025-01-09 22:04:47,689] INFO: Resuming iteration #0.
Current stage: initialize
[2025-01-09 22:04:47,689] INFO: 1 new tasks have been issued.
[2025-01-09 22:04:47,690] INFO: Processing task `misfit_and_gradient`
[2025-01-09 22:04:48,016] INFO: 
Iteration 0: Number of events: 5	 chi = 0.018681064754203797	 ||g|| = 0.022700663670338374
pred = ---	ared = ---	norm_update = ---	tr_radius = ---
[2025-01-09 22:04:48,017] INFO: 1 new tasks have been issued.
[2025-01-09 22:04:48,017] INFO: Processing task `preconditioner`

[2025-01-09 22:04:48,208] INFO: Some tasks of iteration #0 are still running. Please check again later.
p.inversions.resume(
    inverse_problem_configuration="my_inversion",
)
[2025-01-09 22:04:48,239] INFO: Resuming iteration #0.
Current stage: compute_trial_model
[2025-01-09 22:04:48,239] INFO: Processing task `preconditioner`
[2025-01-09 22:04:48,285] INFO: Some tasks of iteration #0 are still running. Please check again later.
p.viz.nb.iteration(
    inverse_problem_configuration="my_inversion", iteration_id=0
)
If you feel confident and don't want to be bothered with every single task, you can also tell SalvusOpt to run an entire iteration at once. Note the parameter timeout_in_seconds, which will force the cell to return even if the iteration has not been completed yet, and there might still be a few simulations running in the back.
Again, you can execute the cell several times or mix it with calls to the previous one until the iteration is complete.
p.inversions.iterate(
    inverse_problem_configuration="my_inversion",
    timeout_in_seconds=360,
    ping_interval_in_seconds=10,
)

p.viz.nb.inversion(inverse_problem_configuration="my_inversion")
[2025-01-09 22:04:48,899] INFO: Resuming iteration #0.

[2025-01-09 22:04:48,900] INFO: Processing task `preconditioner`
[2025-01-09 22:04:48,937] INFO: Some tasks of iteration #0 are still running. Please check again later.
[2025-01-09 22:04:58,939] INFO: Processing task `preconditioner`
[2025-01-09 22:04:59,064] INFO: 1 new tasks have been issued.
[2025-01-09 22:04:59,065] INFO: Processing task `misfit`
[2025-01-09 22:04:59,267] INFO: Submitting job array with 5 jobs ...

[2025-01-09 22:04:59,359] INFO: Launched simulations for 5 events. Please check again to see if they are finished.
[2025-01-09 22:04:59,360] INFO: Some tasks of iteration #0 are still running. Please check again later.
[2025-01-09 22:05:09,373] INFO: Processing task `misfit`
[2025-01-09 22:05:10,956] INFO: 
old misfit control group: 0.018681064754203797
new misfit control group: 0.007439779782203874
predicted reduction control group: -0.009534746479370205
actual reduction control group: -0.011241284971999922
5 out of 5 event(s) improved the misfit.
[2025-01-09 22:05:10,956] INFO: 
Model update accepted.
[2025-01-09 22:05:10,957] INFO: 1 new tasks have been issued.
[2025-01-09 22:05:10,957] INFO: Processing task `finalize_iteration`
[2025-01-09 22:05:10,981] INFO: Succesfully completed iteration #0.
[2025-01-09 22:05:10,983] INFO: Adding new iteration #1.
There is still quite a strong imprint of the source locations, but we start moving into the right direction. The high velocity inclusion in the top part also shows up in the density update.
The first update gave us confidence in the setup. For the moment, our work here is done. Let's run a few more iterations, lean back and wait for the results.
We just loop over the iterate function and perform 3 more model updates. Intermediate data can pile up quickly during an inversion. Each iteration computes waveforms and gradients for a list of events and creates additional auxiliary files for preconditioning and approximating the curvature. When running low on disk-space it is thus advisable to remove those files using the function delete_disposable_files. It will automatically clear data from waveforms, gradients and auxiliary files for completed iterations, which are no longer needed to continue with the inversion.
This operation is safe in the sense that those data can always be recreated with the provenance information about the model and the simulation. Note, however, that recomputing would involve additional simulations.
for i in range(2):
    p.inversions.iterate(
        inverse_problem_configuration="my_inversion",
        timeout_in_seconds=360,
        ping_interval_in_seconds=10,
    )
    p.inversions.delete_disposable_files(
        inverse_problem_configuration="my_inversion",
        data_to_remove=["auxiliary", "gradients", "waveforms"],
    )
[2025-01-09 22:05:11,942] INFO: Resuming iteration #1.

[2025-01-09 22:05:11,943] INFO: 1 new tasks have been issued.
[2025-01-09 22:05:11,943] INFO: Processing task `gradient`
[2025-01-09 22:05:12,149] INFO: Submitting job array with 5 jobs ...
Uploading 1 files...
Uploading 1 files...
Uploading 1 files...
Uploading 1 files...
Uploading 1 files...

[2025-01-09 22:05:12,207] INFO: Launched adjoint simulations for 5 events. Please check again to see if they are finished.
[2025-01-09 22:05:12,209] INFO: Some tasks of iteration #1 are still running. Please check again later.
[2025-01-09 22:05:22,213] INFO: Processing task `gradient`
[2025-01-09 22:05:22,347] INFO: 5 events have already been submitted. They will not be submitted again.
[2025-01-09 22:05:22,918] INFO: 
Iteration 1: Number of events: 5	 chi = 0.007439779782203875	 ||g|| = 0.009973572871663063
pred = -0.009534746479370205	ared = -0.011241284971999922	norm_update = 0.9389411599540233	tr_radius = 0.9389411625078163
[2025-01-09 22:05:22,927] INFO: 1 new tasks have been issued.
[2025-01-09 22:05:22,928] INFO: Processing task `preconditioner`

[2025-01-09 22:05:23,010] INFO: Some tasks of iteration #1 are still running. Please check again later.
[2025-01-09 22:05:33,036] INFO: Processing task `preconditioner`
[2025-01-09 22:05:33,164] INFO: 1 new tasks have been issued.
[2025-01-09 22:05:33,164] INFO: Processing task `misfit`
[2025-01-09 22:05:33,206] INFO: Submitting job array with 5 jobs ...

[2025-01-09 22:05:33,295] INFO: Launched simulations for 5 events. Please check again to see if they are finished.
[2025-01-09 22:05:33,296] INFO: Some tasks of iteration #1 are still running. Please check again later.
[2025-01-09 22:05:43,335] INFO: Processing task `misfit`
[2025-01-09 22:05:44,853] INFO: 
old misfit control group: 0.007439779782203874
new misfit control group: 0.005660667615804366
predicted reduction control group: -0.0009140965914723943
actual reduction control group: -0.0017791121663995081
5 out of 5 event(s) improved the misfit.
[2025-01-09 22:05:44,853] INFO: 
Model update accepted.
[2025-01-09 22:05:44,854] INFO: 1 new tasks have been issued.
[2025-01-09 22:05:44,854] INFO: Processing task `finalize_iteration`
[2025-01-09 22:05:44,918] INFO: Succesfully completed iteration #1.
[2025-01-09 22:05:44,920] INFO: Adding new iteration #2.
[2025-01-09 22:05:44,925] INFO: ... searching for obsolete files in project/INVERSIONS/my_inversion/00000
[2025-01-09 22:05:44,928] INFO: ... searching for obsolete files in project/INVERSIONS/my_inversion/00001
[2025-01-09 22:05:45,040] INFO: Freed up 6.0 MB of space.
[2025-01-09 22:05:45,043] INFO: Resuming iteration #2.

[2025-01-09 22:05:45,043] INFO: 1 new tasks have been issued.
[2025-01-09 22:05:45,043] INFO: Processing task `gradient`
[2025-01-09 22:05:45,236] INFO: Submitting job array with 5 jobs ...
Uploading 1 files...
Uploading 1 files...
Uploading 1 files...
Uploading 1 files...
Uploading 1 files...

[2025-01-09 22:05:45,296] INFO: Launched adjoint simulations for 5 events. Please check again to see if they are finished.
[2025-01-09 22:05:45,297] INFO: Some tasks of iteration #2 are still running. Please check again later.
[2025-01-09 22:05:55,299] INFO: Processing task `gradient`
[2025-01-09 22:05:55,460] INFO: 5 events have already been submitted. They will not be submitted again.
[2025-01-09 22:05:55,884] INFO: 
Iteration 2: Number of events: 5	 chi = 0.005660667615804367	 ||g|| = 0.006938502336044995
pred = -0.0009140965914723943	ared = -0.0017791121663995081	norm_update = 0.22453958643463912	tr_radius = 1.8778823250156327
[2025-01-09 22:05:55,901] INFO: 1 new tasks have been issued.
[2025-01-09 22:05:55,901] INFO: Processing task `preconditioner`

[2025-01-09 22:05:55,997] INFO: Some tasks of iteration #2 are still running. Please check again later.
[2025-01-09 22:06:06,032] INFO: Processing task `preconditioner`
[2025-01-09 22:06:06,181] INFO: 1 new tasks have been issued.
[2025-01-09 22:06:06,181] INFO: Processing task `misfit`
[2025-01-09 22:06:06,227] INFO: Submitting job array with 5 jobs ...

[2025-01-09 22:06:06,530] INFO: Launched simulations for 5 events. Please check again to see if they are finished.
[2025-01-09 22:06:06,531] INFO: Some tasks of iteration #2 are still running. Please check again later.
[2025-01-09 22:06:16,575] INFO: Processing task `misfit`
[2025-01-09 22:06:19,130] INFO: 
old misfit control group: 0.005660667615804366
new misfit control group: 0.0037546189070832995
predicted reduction control group: -0.0013180566575329556
actual reduction control group: -0.0019060487087210667
5 out of 5 event(s) improved the misfit.
[2025-01-09 22:06:19,130] INFO: 
Model update accepted.
[2025-01-09 22:06:19,131] INFO: 1 new tasks have been issued.
[2025-01-09 22:06:19,131] INFO: Processing task `finalize_iteration`
[2025-01-09 22:06:19,366] INFO: Succesfully completed iteration #2.
[2025-01-09 22:06:19,369] INFO: Adding new iteration #3.
[2025-01-09 22:06:19,414] INFO: ... searching for obsolete files in project/INVERSIONS/my_inversion/00000
[2025-01-09 22:06:19,417] INFO: ... searching for obsolete files in project/INVERSIONS/my_inversion/00001
[2025-01-09 22:06:19,487] INFO: ... searching for obsolete files in project/INVERSIONS/my_inversion/00002
[2025-01-09 22:06:19,618] INFO: Freed up 6.0 MB of space.
Done!
Did we converge towards the ground truth?
p.viz.nb.inversion(inverse_problem_configuration="my_inversion")
Well, at are getting closer and the heterogeneities start to form. You can, of course, continue with more iterations or play around with the settings.
That's the end of this tutorial. We ran a few iterations of full-waveform inversion in a typical aperture for ultrasound screenings. Note that although the problem size is small and we did not apply any sophisticated inversion technique, we were able to create a suitable initial model and to perform a few model update with just a few lines of code. Nothing would change really if we applied this at a larger scale. By changing the site name to a remote site with more compute capabilities, we could easily scale up the problem.
PAGE CONTENTS