Memory optimization and optional parallelization for GWR/MGWR #52

Ziqi-Li · 2019-02-25T23:02:42Z

Hi, this PR is mainly to solve the memory issue with GWR and MGWR. It may seem a lot of changes. I tried my best to conform with what we have right now. All previous tests are passed. I also added several new test cases. Thanks!

Major changes:

In Kernel class, we used to compute entire spatial weights for all locations. This will need to store W (n by n) in memory. The update will be computing spatial weights on the fly when doing each local regression at location i, which results in not storing entire spatial W matrix. This allows GWR fitting to be applied to large dataset (n >20k).
Add multiprocessing as option for GWR/MGWR calibration. A multiprocessing.Pool object can be passed to Sel_BW.search(pool=pool) and GWR.fit() Example notebook is added.
Change MGWR inference computation from search.multi_bw() to MGWR.fit(). Also, add a new method for computing MGWR inference in chunks by introducing a n_chunks argument in MGWR.fit(n_chunks). e.g. when n_chunks=2 (n_chunks=k), the overall memory usage is reduced by a factor of 2 (k). This allows MGWR fitting to be applied to relatively large dataset (10k ~ 40k) within a reasonable time. The effectiveness in reducing memory by increasing n_chunks can be found here.
Add hat_matrix=False as default option for GWR() and MGWR(). Inference statistics are computed on the fly in each local regression. If entire hat matrix is needed for some reasons, one can specify hat_matrix=True, and then hat matrix can be obtained by GWRResults\MGWRResults.S.

Minor changes:
Bug fixes:

offset and spherical parameters in Set_BW are not passed to gwr_func, thus not in effect. Fixed.
Add several test cases of Sel_BW w/wo offset and w/wo spherical.

Enhancement:

Add adj_R2 for gaussian and D2 and adj_D2 (%of deviance explained) for Binomial and Poisson.
Add tests for adj_R2, D2 and validated against gwr4.

Ziqi-Li · 2019-03-19T03:25:42Z

Reviewed with @TaylorOshan and agreed on merging.

Ziqi-Li added 14 commits February 21, 2019 13:47

optim

7459714

update test file

2383db0

update test file

cc1ba76

update test file

645b350

update test

0f80a93

clean

070aa37

clean

2784d6f

add tests

fec47ad

update notebook and tests

2b9115a

memory optimization

da4f601

mem opt

8d03b33

mem opt

b18b8a9

update parallel notebook

d060ca5

update mgwr.fit

b916c2b

Ziqi-Li requested a review from ljwolf February 25, 2019 23:02

memory optimization

5fc5dae

Ziqi-Li marked this pull request as ready for review February 25, 2019 23:03

Ziqi-Li added 7 commits February 28, 2019 12:17

add pool to MGWR.fit

dcc90a0

add pool to MGWR.fit

12f65e0

add pool to MGWR.fit

b27794b

add pool to MGWR.fit

794c20d

add test for multiprocessing

07172ff

add verbose to sel_bw

6de2e27

conform to pep8

14b7056

Ziqi-Li changed the title ~~Memory optimization for GWR/MGWR~~ Memory optimization and Parallelization for GWR/MGWR Mar 19, 2019

Ziqi-Li changed the title ~~Memory optimization and Parallelization for GWR/MGWR~~ Memory optimization and parallelization for GWR/MGWR Mar 19, 2019

Ziqi-Li changed the title ~~Memory optimization and parallelization for GWR/MGWR~~ Memory optimization and optional parallelization for GWR/MGWR Mar 19, 2019

Ziqi-Li merged commit eae8ac3 into pysal:master Mar 19, 2019

Ziqi-Li mentioned this pull request Mar 19, 2019

Distance matrix calculation is not vectorized for lat, lon (spherical) coordinates #49

Closed

Ziqi-Li mentioned this pull request Jul 15, 2019

Large-scale data cause the server down #66

Closed

Ziqi-Li deleted the optim branch December 31, 2019 17:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Memory optimization and optional parallelization for GWR/MGWR #52

Memory optimization and optional parallelization for GWR/MGWR #52

Uh oh!

Ziqi-Li commented Feb 25, 2019

Uh oh!

Ziqi-Li commented Mar 19, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Memory optimization and optional parallelization for GWR/MGWR #52

Memory optimization and optional parallelization for GWR/MGWR #52

Uh oh!

Conversation

Ziqi-Li commented Feb 25, 2019

Uh oh!

Ziqi-Li commented Mar 19, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant