首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
flowAMOEBA: Identifying Regions of Anomalous Spatial Interactions   总被引:1,自引:0,他引:1  
This study aims at developing a data‐driven and bottom‐up spatial statistic method for identifying regions of anomalous spatial interactions (clusters of extremely high‐ or low‐value spatial flows), based on which it creates a spatial flow weights matrix. The method, dubbed flowAMOEBA, upgrades a multidirectional optimum ecotope‐based algorithm (AMOEBA) from areal data to spatial flow data through a proper spatial flow neighborhood definition. The method has the potential to dramatically change the way we study spatial interactions. First, it breaks the convention that spatial interaction data are always collected and modeled between spatial entities of the same granularity, as it delineates the OD region of anomalous spatial interactions, regardless of the size, shape, scale, or administrative level. Second, the method creates an empirical spatial flow weights matrix that can handle network autocorrelation embedded in spatial interaction modeling, thus improving related policy‐making or problem‐solving strategies. flowAMOEBA is tested and demonstrated on a synthetic data set as well as a county‐to‐county migration data set.  相似文献   

2.
Constructing the Spatial Weights Matrix Using a Local Statistic   总被引:3,自引:0,他引:3  
Spatial weights matrices are necessary elements in most regression models where a representation of spatial structure is needed. We construct a spatial weights matrix, W , based on the principle that spatial structure should be considered in a two‐part framework, those units that evoke a distance effect, and those that do not. Our two‐variable local statistics model (LSM) is based on the Gi* local statistic. The local statistic concept depends on the designation of a critical distance, dc, defined as the distance beyond which no discernible increase in clustering of high or low values exists. In a series of simulation experiments LSM is compared to well‐known spatial weights matrix specifications—two different contiguity configurations, three different inverse distance formulations, and three semi‐variance models. The simulation experiments are carried out on a random spatial pattern and two types of spatial clustering patterns. The LSM performed best according to the Akaike Information Criterion, a spatial autoregressive coefficient evaluation, and Moran's I tests on residuals. The flexibility inherent in the LSM allows for its favorable performance when compared to the rigidity of the global models.  相似文献   

3.
The statistics Gi(d) and Gi*(d), introduced in Getis and Ord (1992) for the study of local pattern in spatial data, are extended and their properties further explored. In particular, nonbinary weights are allowed and the statistics are related to Moran's autocorrelation statistic, I. The correlations between nearby values of the statistics are derived and verified by simulation. A Bonferroni criterion is used to approximate significance levels when testing extreme values from the set of statistics. An example of the use of the statistics is given using spatial-temporal data on the AIDS epidemic centering on San Francisco. Results indicate that in recent years the disease is intensifying in the counties surrounding the city.  相似文献   

4.
One approach to dealing with spatial autocorrelation in regression analysis involves the filtering of variables in order to separate spatial effects from the variables’ total effects. In this paper we compare two filtering approaches, both of which allow spatial statistical analysts to use conventional linear regression models. Getis’ filtering approach is based on the autocorrelation observed with the use of the Gi local statistic. Griffith's approach uses an eigenfunction decomposition based on the geographic connectivity matrix used to compute a Moran's I statistic. Economic data are used to compare the workings of the two approaches. A final comparison with an autoregressive model strengthens the conclusion that both techniques are effective filtering devices, and that they yield similar regression models. We do note, however, that each technique should be used in its appropriate context.  相似文献   

5.
Spatial Entropy     
A major problem in information theory concerns the derivation of a continuous measure of entropy from the discrete measure. Many analysts have shown that Shannon's treatment of this problem is incomplete, but few have gone on to rework his analysis. In this paper, it is suggested that a new measure of discrete entropy which incorporates interval size explicitly is required; such a measure is fundamental to geography and this statistic has been called spatial entropy. The use of the measure is first illustrated by application to one-and two-dimensional aggregation problems, and then the implications of this statistic for Wilson's entropy-maximizing method are traced. Theil's aggregation statistic is reinterpreted in spatial terms, and finally, some heuristics are suggested for the design of real and idealized spatial systems in which entropy is at a maximum.  相似文献   

6.
A programming approach is presented for identifying the form of the weights matrix W which either minimizes or maximizes the value of Moran's spatial autocorrelation statistic for a specified vector of data values. Both nonlinear and linear programming solutions are presented. The former are necessary when the sum of the links in W is unspecified while the latter can be used if this sum is fixed. The approach is illustrated using data examined in previous studies for two variables measured for the counties of Eire. While programming solutions involving different sets of constraints are derived, all yield solutions in which the number of nonzero elements in W is considerably smaller than that in W defined using the contiguity relationships between the counties. In graph theory terms, all of the Ws derived define multicomponent graphs. Other characteristics of the derived Ws are also presented.  相似文献   

7.
8.
A test statistic for the detection of spatial clusters is developed by generalizing the common chi-square goodness-of-fit test. The paper includes a discussion of the relationship between the statistic and other associated statistics, and provides an analysis of both its null distribution and power. The paper concludes with the development of a local version of the statistic and an application to leukemia clustering in central New York.  相似文献   

9.
A fundamental concern of spatial analysts is to find patterns in spatial data that lead to the identification of spatial autocorrelation or association. Further, they seek to identify peculiarities in the data set that signify that something out of the ordinary has occurred in one or more regions. In this paper we provide a statistic that tests for local spatial autocorrelation in the presence of the global autocorrelation that is characteristic of heterogeneous spatial data. After identifying the structure of global autocorrelation, we introduce a new measure that may be used to test for local structure. This new statistic Oi is asymptotically normally distributed and allows for straightforward tests of hypotheses. We provide several numerical examples that illustrate the performance of this statistic and compare it with another measure that does not account for global structure.  相似文献   

10.
A test statistic for the detection of spatial clusters is developed by generalizing the common chi-square goodness-of-fit test. The paper includes a discussion of the relationship between the statistic and other associated statistics, and provides an analysis of both its null distribution and power. The paper concludes with the development of a local version of the statistic and an application to leukemia clustering in central New York.  相似文献   

11.
This study assesses the overall spatial variations and neighbourhood‐level “hot spots” of low birth weight and preterm birth incidence within three public health units in Ontario, Canada. The analysis uses a stepwise approach of intra‐class correlation analysis, a spatial scan statistic, and multilevel spatial modeling. Results show that neighbourhood level variation accounts for only 2–3 percent of the total variation of adverse birth outcomes in the study area. However, strong spatial autocorrelation is observed at the neighbourhood level, and spatial clusters of relatively high adverse birth outcome rates exist in areas that are associated with environmental risks, including pollution sources and proximity to highways. Thus, although estimated neighbourhood impacts on adverse birth outcomes are small compared with those of individual‐level risks, local high potential environmental risk areas are identifiable. Environmental surveillance and spatial statistical analysis should be conducted regularly by local health authorities to identify and monitor the impact of environmental changes on health in general and on birth outcomes in particular. Specific community‐oriented health interventions may be required to reduce observed local health impacts.  相似文献   

12.
Assessing the significance of multiple and dependent comparisons is an important, and often ignored, issue that becomes more critical as the size of data sets increases. If not accounted for, false-positive differences are very likely to be identified. The need to address this issue has led to the development of a myriad of procedures to account for multiple testing. The simplest and most widely used technique is the Bonferroni method, which controls the probability that a true null hypothesis is incorrectly rejected. However, it is a very conservative procedure. As a result, the larger the data set the greater the chances that truly significant differences will be missed. In 1995, a new criterion, the false discovery rate (FDR), was proposed to control the proportion of false declarations of significance among those individual deviations from null hypotheses considered to be significant. It is more powerful than all previously proposed methods. Multiple and dependent comparisons are also fundamental in spatial analysis. As the number of locations increases, assessing the significance of local statistics of spatial association becomes a complex matter. In this article we use empirical and simulated data to evaluate the use of the FDR approach in appraising the occurrence of clusters detected by local indicators of spatial association. Results show a significant gain in identification of meaningful clusters when controlling the FDR, in comparison to more conservative approaches. When no control is adopted, false clusters are likely to be identified. If a conservative approach is used, clusters are only partially identified and true clusters are largely missed. In contrast, when the FDR approach is adopted, clusters are fully identified. Incorporating a correction for spatial dependence to conservative methods improves the results, but not enough to match those obtained by the FDR approach.  相似文献   

13.
Most test statistics for detecting spatial clustering cannot distinguish between low-value spatial clustering and high-value spatial clustering, and none is designed to explicitly detect high-value clustering, low-value clustering, or both. To fill this void in practice, we introduce an adjustment procedure that can supplement common two-sided spatial clustering tests so that a one-sided conclusion can be reached. The procedure is applied to Moran's I and Tango's C G in both simulated and real-world spatial patterns. The results show that the adjustment procedure can account for the influence of low-value clusters on high-value clustering and vice versa. The procedure has little effect on the original global testing methods when there is no clustering. When there is a clustering tendency, the procedure can unambiguously distinguish the existence of high-value clusters or low-value clusters or both.  相似文献   

14.
This article considers the most important aspects of model uncertainty for spatial regression models, namely, the appropriate spatial weight matrix to be employed and the appropriate explanatory variables. We focus on the spatial Durbin model (SDM) specification in this study that nests most models used in the regional growth literature, and develop a simple Bayesian model‐averaging approach that provides a unified and formal treatment of these aspects of model uncertainty for SDM growth models. The approach expands on previous work by reducing the computational costs through the use of Bayesian information criterion model weights and a matrix exponential specification of the SDM model. The spatial Durbin matrix exponential model has theoretical and computational advantages over the spatial autoregressive specification due to the ease of inversion, differentiation, and integration of the matrix exponential. In particular, the matrix exponential has a simple matrix determinant that vanishes for the case of a spatial weight matrix with a trace of zero. This allows for a larger domain of spatial growth regression models to be analyzed with this approach, including models based on different classes of spatial weight matrices. The working of the approach is illustrated for the case of 32 potential determinants and three classes of spatial weight matrices (contiguity‐based, k‐nearest neighbor, and distance‐based spatial weight matrices), using a data set of income per capita growth for 273 European regions.  相似文献   

15.
The ability to detect anomalies such as spatial clustering in data sets plays an important role in spatial data analysis, leading to interest in test statistics identifying patterns exhibiting significant levels of clustering. Toward this end, Tango (1995) proposed a statistic (and its associated distribution under a null hypothesis of no clustering) assessing overall patterns of spatial clustering in a set of observed regional counts. Rogerson (1999) observed that Tango's index may be decomposed into the summation of two distinct statistics, the first mirroring standard tests of goodness-of-fit (GOF), and the second an index of spatial association (SA) similar to Moran's I . In this article, we investigate the effectiveness of Rogerson's expression of Tango's statistic in separating GOF from SA in data sets containing clusters. We simulate data under the null hypothesis of no clustering as well as two alternative hypotheses. The first alternative hypothesis induces a poor fit from the null hypothesis while maintaining independent observations and the second alternative hypothesis induces spatial dependence while maintaining fit. Using Rogerson's decomposition and leukemia incidence data from upstate New York, we show graphically that one is unable to statistically distinguish poor fit from autocorrelation.  相似文献   

16.
Based on a large number of Monte Carlo simulation experiments on a regular lattice, we compare the properties of Moran's I and Lagrange multiplier tests for spatial dependence, that is, for both spatial error autocorrelation and for a spatially lagged dependent variable. We consider both bias and power of the tests for six sample sizes, ranging from twenty-five to 225 observations, for different structures of the spatial weights matrix, for several underlying error distributions, for misspecified weights matrices, and for the situation where boundary effects are present. The results provide an indication of the sample sizes for which the asymptotic properties of the tests can be considered to hold. They also illustrate the power of the Lagrange multiplier tests to distinguish between substantive spatial dependence (spatial lag) and spatial dependence as a nuisance (error autocorrelation).  相似文献   

17.
The rank adjacency statistic D is a statistical method for assessing spatial autocorrelation or clustering of geographical data. It was originally proposed for summarizing the geographical patterns of cancer data in Scotland (IARC 1985). In this paper, we investigate the power of the rank adjacency statistic to detect spatial clustering when a small number of regions is involved. The investigations were carried out using Monte Carlo simulations, which involved generating patterned/clustered values and computing the power with which the D statistic would detect it. To investigate the effects of region shapes, structure of the regions, and definition of weights, simulations were carried out using two different region shapes, binary and nonhinary weights, and three different lattice structures. The results indicate that in the typical example of considering Canadian total mortality at the electoral district level, the D statistic had adequate power to detect general spatial autocorrelation in twenty‐five or more regions. There was an inverse relationship between power and the level of connectedness of the regions, which depends on the weighting function, shape, and arrangement of the regions. The power of the D statistic was also found to compare favorably with that of Moran's I statistic.  相似文献   

18.
Spatial autocorrelation (SA) is regarded as an important dimension of spatial pattern. SA measures usually consist of two components: measuring the similarity of attribute values and defining the spatial relationships among observations. The latter component is often represented by a spatial weights matrix that predefines spatial relationship between observations in most measures. Therefore, SA measures, in essence, are measures of attribute similarity, conditioned by spatial relationship. Another dimension of spatial pattern can be explored by controlling observations to be compared based upon the degree of attribute similarity. The resulting measures are spatial proximity measures of observations, meeting predefined attribute similarity criteria. Proposed measures reflect degrees of clustering or dispersion for observations meeting certain levels of attribute similarity. An existing spatial autocorrelation framework is expanded to a general framework to evaluate spatial patterns and can accommodate the proposed approach measuring proximity. Analogous to the concept of variogram, clustergram is proposed to show the levels of spatial clustering over a range of attribute similarity, or attribute lags. Specific measures based on the proposed approach are formulated and applied to a hypothetical landscape and an empirical example, showing that these new measures capture spatial pattern information not reflected by traditional spatial autocorrelation measures.  相似文献   

19.
Regionalization or districting problems commonly require each individual spatial unit to participate exclusively in a single region or district. Although this assumption is appropriate for some regionalization problems, it is less realistic for delineating functional clusters, such as metropolitan areas and trade areas where a region does not necessarily have exclusive coverage with other regions. This paper develops a spatial optimization model for detecting functional spatial clusters, named the p‐functional clusters location problem (p‐FCLP), which has been developed based on the Covering Location Problem. By relaxing the complete and exhaustive assignment requirement, a functional cluster is delineated with the selective spatial units that have substantial spatial interaction. This model is demonstrated with applications for a functional regionalization problem using three journey‐to‐work flow datasets: (1) among the 46 counties in South Carolina, (2) the counties in the East North Central division of the US Census, and (3) all counties in the US. The computational efficiency of p‐FCLP is compared with other regionalization problems. The computational results show that detecting functional spatial clusters with contiguity constraints effectively solves problems with optimality in a mixed integer programming (MIP) approach, suggesting the ability to solve large instance applications of regionalization problems.  相似文献   

20.
产业集群现已成为发展区域经济和增强区域竞争力最有效的途径之一。然而,如何识别和选择产业集群则是困扰决策者和研究者的主要技术问题。本文在分析国内外产业集群相关研究方法的基础上,以陕西省为例,综合应用LQ法和基于投入产出表的主成分分析法,尝试性地进行了区域产业集群的识别与选择研究,结论显示LQ法和基于投入产出表的主成分分析法可以很好地体现产业集群的"空间联系"和"功能联系"。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号