1 Cartographic Visualization in Support of Dialectology Pius Sibler, Robert Weibel, Elvira Glaser & Gabriela Bart ABSTRACT: Using data from the Syntactic Atlas of Swiss German Dialects (SADS), several methods for the cartographic visualization of linguistic data are proposed, demonstrated, and evaluated against the requirements of linguistic research. After reviewing the challenges of linguistic data visualization, point symbol maps are introduced as a baseline. We then present alternative visualization methods that present linguistic data in new ways. The first uses Voronoi polygons about the data points to color in the dominant variant per location. The second uses kernel density estimation (KDE) to interpolate intensity values of all variants and thus infer and display the dominant variant per location (incl. at missing value locations). The KDE-based method helps to better see trends in the data and also automatically infer isoglosses. The third new technique uses 3-D visualization to support the exploration of spatial trends, as well as cooccurring variants. As a fourth alternative, measures and methods from geostatistics are used for the visualization of specific, global and local, variations and patterns. KEYWORDS: Linguistic area formation, dialectology, cartographic visualization techniques, kernel density estimation, geostatistics Introduction In an attempt to document linguistic variation across geographic space and study the formation of linguistic areas (or, as geographers prefer to say, regions), such as those formed by different dialects of a language, linguists have been, and still are, routinely collecting linguistic data. These observation data provide evidence of the language spoken at selected locations, or sites. Linguistic data collections are then normally published in the form of atlases, such as LAMSAS (Linguistic Atlas of the Middle and South Atlantic States) in the USA (McDavid et al., 1980). Visualizing linguistic observations, however, poses formidable challenges. First, it is common for different variants of a linguistic feature to co-occur at the same geographic location. For instance, the although variant seems to be preferred over though in the Northeast of the USA (Grieve et al., 2011). However, in most places of the USA, both variants are possible. Second, data are typically only collected at relatively few selected locations, and these sites represented by points that are thought to act en lieu of entire municipalities (or other administrative areal units), with no observations available for most places. Third, from linguistic point data it is hard to infer clear-cut boundaries separating language areas (called isoglosses in linguistics, such as the boundary between though / although occurrences in the USA), and there is also a certain arbitrariness to the definition of isoglosses. In the terminology of Smith and Varzi (2000) they thus clearly belong to the fiat type boundaries. Furthermore, isoglosses of different linguistic features very rarely overlap (though traditional
2 dialectological theory assumes the existence of bundles of isoglosses; Chambers and Trudgill, 1998). Thus, in an attempt of staying on the safe side, language atlases often rely on simple maps of the raw point observations, possibly combined with selected isoglosses that have been visually ( manually ) inferred. This paper reviews several alternative methods for cartographic visualization of linguistic observations that have originally been collected at point locations. The techniques either use area-class maps, 3-D maps, or geostatistical mapping. Some of these have already been proposed elsewhere for dialectological mapping, while others are known in other fields but have not yet been applied to linguistic data. The main contribution of this paper is to bring these visualization techniques together and assess their utility in the context of dialectology. Related Work While many language atlases traditionally present dialect observations using point symbol maps (Fig. 1), a range of other cartographic methods has been used in linguistic research, though less widespread. Pi (2006) points to weaknesses of isoglosses and proposes an alternative technique called isograph. Rather than drawing boundaries, the isograph links areal units with the smallest percentage difference when compared to adjacent observations (e.g. using the percentages for though vs. although from the above example). Kretzschmar (2003) uses a similar technique, though using join-counts between neighboring sites over a Delaunay triangulation. The handbook edited by Lameli et al. (2010), gives a good overview of the various map types and visualization techniques used in concurrent linguistics and dialectology. In particular researchers working on dialectometry, that is, the quantitative analysis and mapping of dialect features, use visualization techniques that go beyond point symbol maps. Goebl (2010) (incl. also in his earlier work) advocates the use of honeycomb maps and beam maps, that is, Voronoi diagrams and Delaunay triangulations, respectively, constructed about the linguistic point observations. Goebl s article also includes a map from the very early days of dialectology by Haag (1898), which depicts isoglosses and isogloss bundles in a Voronoi-like fashion. Voronoi-based maps are also used by dialectometrist Nerbonne (2009, 2010). Interestingly, however, while in both Goebl s and Nerbonne s work the Voronoi diagram is used as a spatial principle to structure geographic space and extend the point observations to form area-class maps, the actual attributes mapped in this Voronoi structure are not generated using geospatial principles such as spatial interpolation. Instead, both researchers focus on linguistic distances (as a measure of linguistic similarity) and multivariate, non-spatial methods to infer the mapped attributes. Distance measures such as the Levenshtein distance are used to express the similarity of linguistic features, and form the input to multivariate statistical techniques such as cluster analysis and multidimensional scaling (MDS), grouping sites according to aggregate linguistic similarity, hence yielding potential linguistic areas when re-projected to geographic space (i.e. mapped to the Voronoi diagram). The choice of multivariate methods is an effect of the interest of these researchers in mapping aggregate linguistic variation of many linguistic features (Nerbonne, 2009, 2010; Goebl, 2010).
3 Methods of spatial interpolation have been used by researchers in dialectometry with a focus on variations in single (or few) linguistic features. Rumpf et al. (2009) proposed the use of kernel density estimation (KDE) to interpolate the intensities of variants of a single linguistic feature, and mapping the intensities to a Voronoi diagram to yield an area-class map. Note that this method is one of the mapping techniques that will be used further down in this paper. Rumpf et al. (2010) extended their original work to explore geographical similarities between many individual area-class maps. From each individual area-class map, additional information regarding its structural composition in geographic space is extracted. Cluster analysis is then employed to obtain groupings of structurally similar maps. Wattel and van Reenen (2010) use an interpolation method called splashing on presence/absence data of linguistic variants, which is partially similar to the technique by Rumpf et al. (2009), but is used to interpolate a dense grid (rather than smoothed intensities at observation sites) to show transitions between variants. It is interesting to note that while researchers in linguistics, in particular those in dialectometry, have used various techniques that represent core methods of GIScience, instances of the analysis and mapping of linguistic data in GIScience are hard to find. A notable exception is the paper by Lee and Kretzschmar (2003) (where Kretzschmar is a linguist) which introduces the point pattern analysis, in particular join-count statistics, used in Kretzschmar (2003). Another instance is the paper by Hoch and Hayes (2010), which advocates the use of GIS in linguistics and provides a review of related research in linguistics, and proposes the use of several GIS techniques (incl. interpolation by kriging and point pattern statistics such as Ripley s K), but does not really present empirical evidence for the proposals made. Requirements In our paper, we will focus on methods to map variants of single linguistics features, such as the though/although pair. Furthermore, we will use syntactic data as a basis (i.e. observations about the variation of grammatical features) rather than lexical data (i.e. data about variations in the words used). Syntactic features usually show less variation (i.e. a smaller number of variants) than lexical features. Following are the requirements for mapping methods (cf. also Bucheli Berger, 2008): 1) Must be capable of displaying variation in a single linguistic feature (rather than the co-variation of multiple features). 2) Should be capable of displaying co-occurrences of multiple variants of the same feature at the same geographic location (i.e. co-location). 3) Should be capable of filling in missing values (which often occur, since data are usually only collected at few locations, e.g. a subset of municipalities in a study area). That is, should be able to extra-/interpolate values. 4) Should support the delineation of linguistic boundaries as isoglosses.
4 5) Should be capable of displaying the gradients of spatial linguistic variation, and infer linguistic boundaries between variants. 6) Should depict the global spatial patterns of linguistic variation as well as the local patterns. These patterns should be easily perceivable, not necessitating cumbersome study of the map and legend. 7) Should allow displaying different variants with different visual weight (e.g. use light map symbols for a commonly occurring variant, and heavy, eye-catching symbols for infrequently occurring variants to better highlight them in the map). 8) If the number of responses per observation site is variable (e.g. 5 responses for one site, and 20 for another), it should be possible to display these quantities, in order to give an impression of the uncertainty involved in the responses (1 out of 5 is less reliable than 4 out of 20, though the percentage is the same). 9) Should be visually attractive. 10) Should include a base map (e.g. political boundaries, hydrography) that ensures spatial reference. 11) Should provide quantitative measures of spatial linguistic variation that may be statistically tested. Presumably, it will be hard to satisfy all the above requirements equally well with one particular map design, as some of them represent trade-offs. We will now move to present different map design that might be used to meet the above requirements. We will start with the map type that may serve as a baseline point symbol maps and then present four alternative designs. Background and Baseline The review of related work, as well as the definition of requirements have been approached from a general perspective of mapping linguistic data. However, for the purposes of this paper, we will focus on the study of morphosyntax in dialectology, and we will do so using the example of the Syntactic Atlas of Swiss German Dialects (SADS). The Syntactic Atlas of Swiss German Dialects (SADS) The SADS project was initiated in the year 2000 to map and study syntactical phenomena of Swiss German dialects (Bucheli and Glaser, 2002). Close to 3,200 informants participated in the survey. They live in 383 municipalities, which represent approx. 25% of the German speaking municipalities in Switzerland. In other words, for about every fourth municipality, observation data are available, while for the other places no direct testimonies exist. Per observation site, between 3 and 26 informants were involved, with a median value of 5 to 6 informants per municipality. More details about the design of the survey and questionnaires that generated the database for the SADS can be found in Bucheli and Glaser (2002).
5 Use Case: Infinitival Complementizer For the purposes of this paper, we will use for the most part a syntactical construct that is called infinitival complementizer. For instance, in the English sentence I don t have enough change in order to buy a ticket the infinitival complementizer is in order to (or simply to ). It introduces a so-called purposive infinitival clause (since the clause expresses a purpose). In the Standard German equivalent, the infinitival complementizer is um zu ( Ich habe zu wenig Kleingeld, um ein Billet zu lösen ). In Swiss German dialects, two variants of this complementizer exist: zum and für. (As an aside, the noun Billet is the Swiss variant for English ticket (in proper Standard German this would be Fahrkarte. Billet was used since the questionnaire was used in Switzerland.) The two variants of this syntactic feature in Swiss German dialects show an East-West distribution, with the für variant predominantly occurring in the West, and the zum variant in the East. Due to its simple distribution pattern, we use this syntactic feature as our standard use case throughout the paper. Furthermore, and very importantly, linguistic research has also developed hypotheses about the variation of this feature: Seiler (2005), among others, describes the variation of the two variants as two inclined planes, with strike in E-W direction, and dip in easterly direction for für, and westerly direction for zum. Thus, using this use case we cannot only explore the utility of different map designs, but also apply further geostatistical analysis to test the hypotheses that linguistic research has generated. For this paper, we will focus on the inclined plane hypothesis alone. The Baseline: Point Symbol Maps Over the course of the SADS project experiments were made with several different map designs, including point symbol maps in several variations as well as area-class maps (in particular choropleth maps). Following a testing and evaluation phase, the decision was made to use point symbol maps for the production of the atlas (Bucheli Berger, 2008; Bucheli Berger et al., forthcoming). After the production phase of the atlas had started, and after the decision had been made for point symbol maps, a separate project was initiated (Sibler, 2011), which had as its objective to explore different alternative visualization and analysis techniques, some of which will be presented in the following section. This project is not in competition with the SADS atlas production process, and its results will also not directly influence the initial edition of the SADS; bear in mind that the production of such an atlas that comprises many maps, including associated commentaries, is a laborious and complex undertaking. Some of the results, however, might be included in later extensions of the atlas. We thus use point symbol maps as a baseline to compare against. Figures 1 and 2 show two examples of point symbol maps used in the SADS. Both use base maps that depict the topographic relief, major hydrography, and political boundaries (Swiss cantons) to provide a spatial reference. Both also differentiate between single occurrences of a variant in a particular location ( Einzelnennung ) and multiple occurrences ( Mehrere Nennungen ). Figure 1 depicts both variants of our infinitival complementizer example (für and zum), and is restricted to b/w symbology, while Figure 2 is restricted to a single complementizer (für), but it uses color and can thus also show the degree of dominance ( mehrheitlich ).
6 Proceedings - AutoCarto Columbus, Ohio, USA - September 16-18, 2012 Figure 1: Black-and-white point symbol map in SADS. See text for details. Figure 2: Color point symbol map in SADS. See text for details.
7 In both Figure 1 and Figure 2, it is possible to see both the global pattern (an E-W trend) as well as local variations and concentrations or outliers. Furthermore, the data that was collected in the linguistic survey is displayed directly, without further modification or processing by some analytical procedure; the map reader is not influenced by the effects of processing imposed by some quantitative method. On the other hand, point symbol maps rely on the visual perception and cognition, intuition, and linguistic expertise of the map reader to infer meta structures implicitly contained in the point data. Boundaries between variants, or language areas, must be inferred visually. To a great extent, this will be alleviated in the published SADS version by the fact that the maps are accompanied by commentaries that offer a description and interpretation of the linguistic phenomena displayed. It is, however, also possible to draw isoglosses on such maps, in order to visually clarify an interpretation or linguistic hypothesis. An example of this is shown with the isoglosses of Figure 3, which have been overlaid on a map from another language atlas, the dialect atlas of Swiss German (SDS; Hotzenköcherle, ). Figure 3: Section from an SDS map with isoglosses overlaid (base map from Hotzenköcherle et al. 1975, Vol. III: 261). Alternative Visualization Techniques We will now present four alternative mapping methods. They have been developed and tested in the course of the project by Sibler (2011). For an evaluation of different types of maps for the purposes of a language atlas, see also Bucheli Berger et al. (forthcoming). Area-class: Voronoi Polygons When the aim is to generate area-based maps, two questions need to be answered: What should be the areal unit used? And, which attribute is mapped? To answer the first question, one might simply choose the administrative areal units that are associated with the linguistic observations (e.g. zipcode areas, municipalities). However, as mentioned
8 above, linguistic surveys rarely ever cover a study area exhaustively. The example of the SADS, where only a quarter of municipalities has been surveyed, is not uncommon; in fact, it even represents an example of dense data coverage. Thus, a map based on administrative units would be quite perforated, with most units depicting missing values. The Voronoi diagram offers a way out of this problem. Voronoi (or Thiessen) polygons have advantageous properties that are well-known in GIScience: they generate by definition a complete subdivision of the plane (which removes the above problems of holes), and they create a proximal map about the points used to generate the cells of the Voronoi diagram. In our case, we choose the centroids of the SADS sites (i.e. the municipalities that were surveyed for SADS), as shown in Figure 4. As an attribute to be mapped, we choose the intensity of the dominant variant. For each site the percentage of occurrence of each variant is calculated. Next, for each site the dominant variant (i.e. the one that yields the highest percentage) is chosen and mapped, while all others are suppressed. Hence, in Figure 4 we see a similar picture as in Figure 1, but now the observations have been extended to areas. Like Figure 1, which shows an overall trend, but which also has outliers, Figure 4 also looks a bit patchy, though the trend between the two variants of the infinitival complementizer is still visible. This type of map follows the choropleth model: a quantitative attribute is mapped to areal units, using lightness variations. Note also that we could have mapped other attributes that can be extracted from the SADS database, such as number of respondents per site (and thus an indicator of reliability). However, with this map type only a single attribute can be displayed at one time. Figure 4: Voronoi polygons for SADS sites. Cells colored according to dominant variant of the infinitival complementizer variants für vs. zum (Sibler, 2011: 41). Area-class: Intensities from KDE The patchiness of maps like the one of Figure 4 may be an effect of short-range spatial variation of a linguistic feature, or it may simply be the effect of unreliable responses that create noise and uncertainty. We now want to retain the choice of areal units (Voronoi polygons for SADS sites) and the choice of mapped quantity (intensities of linguistic variants), but we want to change the way that the intensities are mapped, removing shortrange variations. Thus, we would like to replace the original intensity values of Figure 4
9 with intensities that represent an estimate of a local neighborhood. For this purpose, interpolation methods can be used, but since we are dealing with counts data, the use of a method that can deal with that type of data is warranted. Hence, we use the method by Rumpf et al. (2009), which uses kernel density estimation (KDE) to infer smooth intensity values. In the following examples, we use KDE with a bandwidth of 10 km. This value was established after calibration (details see Sibler, 2011). Figure 5 shows the working principle of the method. For each variant of a feature in our case the infinitival complementizer variants für and zum KDE-based interpolation yields a smooth intensity surface. These separate surfaces are then merged to form one layer, with only the dominant variant remaining per location. Figure 5: Working principle of KDE-interpolated intensities for dominant variants, on the example of the infinitival complementizer variants für and zum (Sibler, 2011: 29). Figure 6 shows how this method cannot only be used to estimate smoothed intensities at the original SADS locations, but also at locations where intensity values are missing (i.e. for the ¾ of municipalities with no SADS observations). The visual comparison of the two maps of Figure 6 suggests that they largely resemble each other. The map with the Voronoi polygons formed for the centroids of the Swiss municipalities is somewhat smoother, but the interpolation is apparently sufficiently robust to yield an almost equivalent result even when four times more interpolation points are used. Most importantly, however, we can now see much more clearly the East-West trend in the mapped linguistic feature, as it was described by Seiler (2005), compared to the maps of Figures 1 and 4, respectively. In Figure 1, it takes some time (and perhaps also experience) to see the trend emerge from the pattern of points symbols. In Figure 4, the effect is more clearly noticeable, due to the area-class representation. But it is only in Figure 6 where we can clearly see how the red für surface and the blue zum surface gently slope towards each other, forming high intensities at the eastern and western end, and a transition zone in between. There is only one small blue area in the region around Basel that forms an exception.
10 Figure 6: Interpolated intensities from KDE for dominant variants of infinitival complementizers für and zum, for different areal units. Left: Voronoi polygons for SADS locations. Right: Voronoi polygons for the centroids of Swiss municipalities (Sibler, 2011: 41). Figure 7 shows an example from another linguistic feature, the position of the indefinite article in conjunction with a complex adjective phrase. The example uses the sentence (in Standard German): Susi wäre eine ganz liebe Frau für Markus ( Susi would be a very nice woman for Mark ). In Swiss German, this feature yields three variants: 1) preponed, as in Standard German; postponed, as in ganz ä liebi Frau ( very a nice woman ); and doubled, as in e ganz e liebi Frau ( a very a nice woman ). Figure 7 presents the three variants for this linguistic feature in two ways: using Voronoi polygons without interpolation (as in Fig. 4), and using intensities that have been interpolated using KDE (as in Figures 5 and 6). The difference is very clearly pronounced: while in the un-interpolated Voronoi map, relatively little structure is noticeable, a pattern emerges from the interpolated intensities that forms a corridor of the doubled indefinite article variant (e ganz e liebi) from Basel in the Northwest towards the area of Grisons in the Southeast. The preponed variant has disappeared completely (note, however, that even on the simple Voronoi map it only existed in small, isolated pockets). It is interesting to note that previous dialectological research (Richner-Steiner, 2011) had provided hints for the corridor of the doubling variant between Basel and Grisons. Those hints were not strongly pronounced, though: the corridor would only become noticeable if only the places with strong preference for the doubling variant (i.e. > 75% preference) were considered. Thus, it came as a positive surprise when this corridor of the doubling variant was extracted by the KDE-based method. In this case, the dialectometrical approach helped finding a syntactic area where the doubling of the indefinite article is quite common. Conversely, the traditional point symbol maps of the SADS present a rather chaotic distribution that can be interpreted only with difficulty. While Figure 7 showed an example where KDE-based interpolation helps to reveal a pattern of spatial variation of a linguistic feature, Figure 8 shows an example where KDE interpolation conceals rather than reveals. The example is about the complementizer in comparative clauses, as in the following sentence in Standard German: Sie ist grösser als ich ( She is taller than me ). In Swiss German, this feature has four variants: als, weder, wie, wa(n). The als variant is dominating over the whole area, while the others prevail only in very restricted zones, so that KDE interpolation sweeps them away. So, if there was any variation pattern noticeable in the original data, it has now disappeared completely.
11 Figure 7: Three variants of the position of indefinite article in conjunction with an adjective. Left: Dominant variants mapped to Voronoi polygons, without interpolation. Right: Interpolated intensity values of dominant variant from KDE (Sibler, 2011: 46). Figure 8: Four variants for complementizer in comparative clauses. Left: Dominant variants mapped to Voronoi polygons, without interpolation. Right: Interpolated intensity values of dominant variant from KDE (Sibler, 2011: 44). 3-D Views Both preceding map types have the disadvantage that a single attribute (e.g. the dominant variant per location) can be displayed. Variants that are not dominant are concealed, and the interplay of multiple variants is hardly perceivable. A possible way out of this problem is to view the intensities of multiple variants in three dimensions, as it is done in Figure 9 for the well-known infinitival complementizer example. The 2-D Voronoi polygons have been assigned the elevation of the intensity value, colored in according to the usual scheme (red for the für variant, blue for zum), and projected into 3-D space. If an interactive system is used capable of handling 3-D data (this example was done in a standard commercial GIS software system) then the user can manipulate the 3-D view interactively and explore the data from changing perspective, thus getting a better idea of how the different variants interact, how steep the inclined planes (Seiler, 2005) are, etc.
12 Figure 9: Two 3-D views of infinitival complementizer variants für (red) and zum (blue) (Sibler, 2011: 51). Geostatistics to Highlight Specific Patterns The methods discussed so far all focus on cartographic visualization. However, it should be pointed out that the intensity values both without and with KDE interpolation can also be processed further, in an analytical sense. As is well known in GIScience, there are plenty of geostatistics methods available that can be applied to linguistic data. Our linguistic data are originally counts data, which limits the options for geostatistical analysis. However, if these counts are converted to intensities (see above), then many more options become available. Sibler (2011) conducted several geostatistical analyses. Among others, in order to test the hypothesis by Seiler (2005) who postulated that the two variants of the infinitival complementizer would form two opposing inclined planes, trend surface analysis was carried out. Trend surfaces of first to fourth order were fitted to the unsmoothed intensities and the residuals tested with an F-test, showing a highly significant trends (p < 0.01). This analysis also revealed that the general direction of the trend is not from West to East, but rather rotated by 45 degrees, that is SW to NE. For the same intensity data, Moran s I (to test for global autocorrelation) as well semivariogram fitting were used, with semivariograms fitted to two stripes of data, one in the direction parallel to the strike of the inclined planes (SW-NE), the other one perpendicular (NW-SE). This revealed a clear direction dependency of the semivariograms, and thus also supports the inclined plane hypothesis. Another linguistic feature that was further explored and tested was the complementizer in comparative clauses. As Figure 8 showed, the als variant is so wide-spread and dominant over the entire area that only in few cases do the other three variants become dominant. As soon as KDE interpolation is applied, the small pockets of dominance of other variants disappear completely. The question, then, is whether below this blanket of the dominant als variant the other variants show some distinct patterns. On the point symbol maps for this feature, it is very hard to extract clear patterns; only careful study give hints to that extent. On the area-class maps that focus on the display of dominant variants, it is simply impossible to see anything, due to the overwhelming dominance of als. Therefore, the intensity data were subjected to an analysis of local spatial autocorrelation using the Getis-Ord G i * statistic (Ord and Getis, 1995; Getis, 2010), which has also been used in dialectology by Grieve et al. (2011). The result can be seen in Figure 10. Very clearly, it
13 paints a much more differentiated picture than the maps of Figure 8 do. In Figure 10, we can see hot spots (clusters of highly positive Z-scores) and cold spots (clusters of highly negative Z-scores) for all variants, except for the wan variant, which shows only a hot spot in the Valais / Bernese Alps region, where it is the dominant variant. The als variant does no longer appear as overwhelmingly dominant as would seem from Figure 8. The wie variant appears with a prominent hot spot along the Northern border of the Swiss border towards Germany. These quantitatively extracted patterns support the qualitative findings of Friedli (2005), who had reached his conclusions based on the study of point symbol maps. Figure 10: Four variants for complementizer in comparative clauses, displayed with hot/cold spot maps using the Getis-Ord G i * statistic (Sibler, 2011: 76). Discussion We have presented five cartographic visualization methods: 1) point symbol maps (our baseline), 2) area-class maps based on Voronoi polygons, 3) area-class maps using intensities from KDE interpolation, 4) 3-D views, and 5) geostatistical analysis and mapping techniques. In the following discussion we will call these five methods short M1 to M5, in order to save space. Likewise, the requirements will be abridged to R1-R11. Naturally, other methods could be imagined than the five methods presented here. However, given our experience with language and dialect atlases, we argue that they consti-
14 tute a representative sample that stands for a broad range of different characteristics of visualization and GIScience methods. R1 (focus on single features): Since we focus on the study of individual language features (rather than aggregate variation; Nerbonne, 2010), all visualization and analysis methods that assume multivariate input data are automatically ruled out. Conversely, requirement R1 is met by all methods of this paper. R2 (display co-occurring variants; co-location): M1 obviously can display multiple variants at the same spot, though with increasing number of these variants, and increasing density sample sites, the map will become less and less legible, particularly with respect to R6. On the other hand, M2 and M3, with their focus on dominant variants, do not meet R2. M4 can deal with spatial co-occurrence (i.e. co-location), but similarly to M1 legibility decreases rapidly with increasing number of variants. Like all 3-D displays, M4 also suffers from visibility problems (part of the display may be hidden). Finally, M5 focuses primarily on analysis not visualization and is not capable of dealing with co-location. R3 (fill in missing values): Both M1 and M2 focus on the raw data and have no interpolation facility; they cannot infer missing values at locations where no data was collected. M3 interpolates intensity values from KDE (possibly at arbitrary locations) and thus meets R3. M4 relies on intensities generated by M3 (but renders them in 3-D) and this meets R3. And some of the M5 geostatistics methods (kriging, trend surfaces, etc could also interpolate values). R4 (support of isogloss/boundary delineation): M1 supports the delineation of language boundaries graphically. The advantage here is the map reader can visually infer boundaries but is not influenced by the interpretation of an automated, quantitative method. M2 already introduces a model (the Voronoi diagram). Thus, delineation of language boundaries is possible, but will be influenced by the Voronoi structure, which pre-supposes the position of boundaries. M3 clearly helps finding potential language boundaries (sometimes even non-obvious ones, see Figure 7), but it also imposes a computational model that is bound to have an effect on the result (cf. Fig. 8). M4 is less suited for R4, due to visual overlaps and unfamiliar perspective. M5 offers several geostatistical methods that can help in boundary delineation. R5 (gradient display and boundary inference): The requirement is an extension of R4. In M1, gradients must be perceived visually (like the gradients that feel smoother in Fig. 1 and 2 than in Fig. 3). M2, the Voronoi model assumes a discrete surface and can thus not infer gradients. M3, on the other hand, has excellent capabilities for gradient computation, and based on gradients also boundary inference. Since intensity values are the same in M4 as in M3, and in M5, both M4 and M5 also have gradient mapping and delineation capability. R6 (easy-to-read depiction of global/local patterns): As mentioned similarly for R4, M1 allows to visually explore patterns. The human visual and cognitive system is capable of seeing highly complex patterns that are hard to describe. On the other hand, visual interpretation is subjective and may not lead to the same result, if different map readers are
15 given the same task of map reading. M2 is similar regarding fulfillment of R6, though no longer with point data (which might have a perceptual and cognitive effect). M3 in general helps to detect patterns, as through smooth KDE interpolation noise in the input data is suppressed (fig. 7). But it may sometimes also plane away local patterns (cf. Fig. 8). M4 visualizes global trends very well, but local patterns may disappear, not the least due to perspective foreshortening and visibility problems. M5 may support both, the detection of global trends (through Moran s I, semivariograms or trend surfaces) as well as local patterns (local point pattern statistics such as G i *). R7 (weighting of display with frequency of occurrence): As the symbology of the map in Figures 1 and 2 shows, M1 meets R7. M2 to M4 all rely on the display of dominant variants. Thus it is not possible to visualize directly the frequency of occurrence of the variants. However, it is possible to use the frequencies as weights in the computation of the dominant variant in M2 and the intensities of dominant variants in M3 and M4, respectively. Similarly, M5 can also not directly display count frequencies, but can use them as weights in geostatistical analysis. R8 (display reliability / uncertainty of observations): The situation for this requirement is similar to that of R7. M1 is the only method that has the potential to construct point symbols that could directly display reliability of observations (e.g. through change of saturation), although no example of this is shown in this paper. M2 to M5, again will have a problem displaying reliability as a separate visual variable (particularly the areaclass methods M2 and M3), but in all methods, reliability / uncertainty could be used as a weight in establishing quantitative values such as variant intensities. R9 (attractiveness of map): All maps seem attractive, each in its own way. M3, for instance, might be particularly attractive in the interactive version, which allows visual interactive exploration. M3 and M5 may be particularly attractive because they very vividly show global vs. local patterns of linguistic variation. R10 (include base map): All examples of M1 (Fig. 1 to Fig. 3) include base maps. While base maps would be theoretically possible (and advisable!) for all methods, only point symbols can be easily combined with base maps, while it is definitely more demanding to overlay linear or areal data on a base map. The geostatical map (M5) of Figure 10 is essentially also a point symbol map. Hence, the same applies as to M1. R11 (provide quantitative measures for statistical testing): M1 to M4 focus on visualization, and hence are useful primarily for visual, exploratory analysis rather than confirmatory, statistical analysis. Since M3 and M4 use KDE, which generates smooth gradients, fitting calculating derivatives of the intensity surface (e.g. gradient, curvature) could be envisioned. M5 is built for analysis: maps are only a by-product of geostatical analysis. Hence, M5 (geostatistics) really represents the culmination in an analytical sense: After visualization of the type of M1 to M4 have been used, and hypotheses formulated, analysis methods from geostatistics and statistics are used to falsify/verify these hypotheses. Further visualizations such as the one shown in Figure 10 may then be shown to further explore and communicate the results of geostatistical analysis.
16 Conclusions Starting off from the identification of the peculiarities of linguistic data we have defined an extensive set of requirements, which visualizations of linguistic data should meet (with the constraint, though, that they are used to map variations of single linguistic features). We have then presented five different cartographic visualization techniques. The five methods were chosen so they jointly span a broad scope of visualization options, and so they can be used to explore linguistic data in multiple ways, hypothesize about spatial patterns of linguistic variation, and finally test and confirm such linguistic hypotheses. On the basis of data from the Syntactical Atlas of Swiss German Dialects (SADS) we then demonstrated these visualization techniques, which provided the basis for an in-depth evaluation and discussion of the methods regarding the requirements defined initially. Not surprisingly, none of the presented techniques meets all of our requirements (which partially represent trade-offs). However, each of them has its distinctive strengths, so that in combination of several visualizations, optimal results should be achieved. We used point symbol maps as a baseline, since they are used in the current production of the SADS publication, which is nearing completion. The key advantage is that the original observations can be mapped without further processing and alteration by some statistical procedure; the map image is not biased by the potential effects imposed by some quantitative method (e.g. the excessive smoothing effect visible in Figure 8). Also, point symbols can be designed and configured in almost unlimited ways, and they can be easily combined with other map information (e.g. base map). Voronoi polygons of dominant variants provide an easy way to generate area-class maps and to get a first picture of the areal coverage of a linguistic feature. As in points symbol maps, no further processing takes place. On the other hand, they are susceptible to noise in the original data (again like point symbol maps). Area-class maps that are based on intensities from KDE introduce a smooth interpolation function that allows extending local trends across neighborhoods, panes away small outliers and noise, and particularly brings out the big picture. Also, it offers a method to interpolate values where they are missing. 3-D views are based on the intensities calculated by the previous technique, but they eliminate the problem of the 2-D techniques that the only dominant variants are rendered. In 3-D trends and breaks in the spatial variation of a linguistic feature, as well as the interplay of different variants, become better visible and can be explored interactively. Finally, geostatistical analysis and mapping techniques represent a whole family of methods that can be used to analyze and later visualize specific patterns of linguistic variation in geographic space on the global and local level. References Bucheli, C. and Glaser, E. (2002) The Syntactic Atlas of Swiss German Dialects: Empirical and Methodological Problems. In: Barbiers, S., Cornips, L. and van der Kleij, S. (eds.) Syntactic Microvariation, Vol. 2. Amsterdam: Meertens Institute Electronic Publications in Linguistics, pp
17 Bucheli Berger, Claudia (2008) Neue Technik, alte Probleme: auf dem Weg zum Syntaktischen Atlas der Deutschen Schweiz (SADS). In St. Elspass, W. König (eds.) Sprachgeographie digital die neue Generation der Sprachatlanten. Hildesheim: Olms (= Germanistische Linguistik ) Bucheli Berger, Claudia, Elvira Glaser and Guido Seiler (forthcoming) Is a syntactic dialectology possible? Contributions from Swiss German. In A. Ender, A. Leemann & B. Wälchli (eds.) Methods in Contemporary Linguistics. Accepted for publication in the Trends in Linguistics Series. Berlin: de Gruyter Mouton. Chambers, J. K. and Trudgill, P. (1998) Dialectology, 2ed. Cambridge: Cambridge University Press. Friedli, M. (2005) Si isch grösser weder ig! Zum Komparativanschluss im Schweizerdeutschen. In: Christen, H. (ed.) Dialektologie an der Jahrtausendwende. Linguistik Online, Vol. 24, No. 3. Available at (last accessed on 15 August 2012). Getis, A. (2010) Spatial Autocorrelation. In: Fischer, M. M. and Getis, A. (ed.) Handbook of Applied Spatial Analysis. Berlin/Heidelberg/New York: Springer. Goebl, H. (2010) Language and Space, Vol. 2: Language and Mapping. Berlin: De Gruyter Mouton, pp Grieve, J., Speelman, D. and Geeraerts, D. (2011) A statistical method for the identification and aggregation of regional linguistic variation. Language Variation and Change, 23, pp Haag, K. (1898) Die Mundarten des oberen Neckar- und Donaulandes. Reutlingen: Buchdruckerei Egon Hutzler. Hotzenköcherle, R. et al. (eds.) ( ) Sprachatlas der Deutschen Schweiz. Volumes I-VIII. Tübingen: Francke. Hotzenköcherle, R. et al. (eds.) (1975) Sprachatlas der Deutschen Schweiz. Volume III. Tübingen: Francke. Hoch S. & Hayes J. J. (2010) Geolinguistics: The Incorporation of Geographic Information Systems and Science. The Geographical Bulletin, 51, 1, pp Lameli, A., Kehrein, R. and Rabanus, S. (2010) Language and Space. Vol. 2: Language Mapping. Berlin: De Gruyter Mouton. Lee, J. & Kretzschmar, Jr., W. (1993) Spatial analysis of linguistic data with GIS functions. Int. Journal of Geographical Information Systems, 7, 6, pp McDavid, R.I., Jr. and O Chain, R. (1980) Linguistic Atlas of the Middle and South Atlantic States. Chicago: University of Chicago Press. Nerbonne, J. (2009) Data-driven Dialectology. Language and Linguistics Compass, 3, 1, pp
18 Nerbonne J. (2010) Language and Space. Vol. 2: Language Mapping. Berlin: De Gruyter Mouton. pp Ord, J. K. and Getis, A. (1995) Local Spatial Autocorrelation Statistics. Distributional Issues and an Application. Geographical Analysis, 27, pp Pi, C.-Y. (2006) Beyond the Isogloss: Isographs in Dialect Topography. Canadian Journal of Linguistics, 51, pp Richner-Steiner, Janine (2011)»E ganz e liebi Frau«Zu den Stellungsvarianten in der adverbiell erweiterten Nominalphrase im Schweizerdeutschen. Eine dialektologische Untersuchung mit quantitativ-geographischem Fokus. PhD Dissertation, German Department, University of Zurich. Rumpf, J. Pickl, S. Elspass, S. König, W. and Schmidt, V. (2009) Structural analysis of dialect maps using methods from spatial statistics. Zeitschrift für Dialektologie und Linguistik, 76, 3, pp Rumpf, J. Pickl, S. Elspass, S. König, W. & Schmidt, V. (2010) Quantification and Statistical Analysis of Structural Similarities in Dialectological Area-class Maps. Dialectologia et Geolinguistica, 18, pp Seiler, G. (2005) Moderne Dialekte neue Dialektologie. Stuttgart: Steiner. pp Sibler, P. (2011) Visualization and Geostatistical Analysis with Data of the Syntactic Atlas of Swiss German Dialects (SADS), Master s Thesis, Zurich: Department of Geography, University of Zurich. Smith, B. and Varzi, A. (2000) Fiat and bona fide boundaries. Philosophy and Phenomenological Research, 60, pp Wattel, E. and van Reenen, P. (2010) Language and Space. Vol. 2: Language Mapping. Berlin: De Gruyter Mouton, pp Pius Sibler, Application Engineer, Intergraph (Switzerland) AG, Neumattstrasse 24, 8953 Dietikon (Switzerland). Robert Weibel, Professor, Department of Geography, University of Zurich, Winterthurerstrasse 190, 8057 Zurich (Switzerland). Elvira Glaser, Professor, German Department, University of Zurich, Schönberggasse 9, 8001 Zurich (Switzerland). Gabriela Bart, Research Assistant, German Department, University of Zurich, Schönberggasse 9, 8001 Zurich (Switzerland).
EXPLORING SPATIAL PATTERNS IN YOUR DATA OBJECTIVES Learn how to examine your data using the Geostatistical Analysis tools in ArcMap. Learn how to use descriptive statistics in ArcMap and Geoda to analyze
Geoinformatics 2004 Proc. 12th Int. Conf. on Geoinformatics Geospatial Information Research: Bridging the Pacific and Atlantic University of Gävle, Sweden, 7-9 June 2004 GEO-VISUALIZATION SUPPORT FOR MULTIDIMENSIONAL
What is GIS? Geographic Information Systems Introduction to ArcGIS A database system in which the organizing principle is explicitly SPATIAL For CPSC 178 Visualization: Data, Pixels, and Ideas. What Can
Master s Thesis: ANAMELECHI, FALASY EBERE Analysis of a Raster DEM Creation for a Farm Management Information System based on GNSS and Total Station Coordinates Duration of the Thesis: 6 Months Completion
Data Visualization Techniques and Practices Introduction to GIS Technology Michael Greene Advanced Analytics & Modeling, Deloitte Consulting LLP March 16 th, 2010 Antitrust Notice The Casualty Actuarial
COMPARATIVES WITHOUT DEGREES: A NEW APPROACH FRIEDERIKE MOLTMANN IHPST, Paris firstname.lastname@example.org It has become common to analyse comparatives by using degrees, so that John is happier than Mary would
CO-205 THREE-DIMENSIONAL CARTOGRAPHIC REPRESENTATION AND VISUALIZATION FOR SOCIAL NETWORK SPATIAL ANALYSIS SLUTER C.R.(1), IESCHECK A.L.(2), DELAZARI L.S.(1), BRANDALIZE M.C.B.(1) (1) Universidade Federal
Instituto Superior de Estatística e Gestão de Informação Universidade Nova de Lisboa Master of Science in Geospatial Technologies Geostatistics Exploratory Analysis Carlos Alberto Felgueiras email@example.com
Using GIS to Identify Pedestrian-Vehicle Crash Hot Spots and Unsafe Bus Stops Using GIS to Identify Pedestrian- Vehicle Crash Hot Spots and Unsafe Bus Stops Long Tien Truong and Sekhar V. C. Somenahalli
Jointly published by Akadémiai Kiadó, Budapest Scientometrics, and Kluwer Academic Publishers, Dordrecht Vol. 51, No. 3 (2001) 563 571 Utilizing spatial information systems for non-spatial-data analysis
Enhancing Information and Methods for Health System Planning and Research, Institute for Clinical Evaluative Sciences (ICES), January 19-20, 2004, Toronto, Canada Workshop: Using Spatial Analysis and Maps
Using Spatial Statistics In GIS K. Krivoruchko a and C.A. Gotway b a Environmental Systems Research Institute, 380 New York Street, Redlands, CA 92373-8100, USA b Centers for Disease Control and Prevention;
USING SELF-ORGANIZING MAPS FOR INFORMATION VISUALIZATION AND KNOWLEDGE DISCOVERY IN COMPLEX GEOSPATIAL DATASETS Koua, E.L. International Institute for Geo-Information Science and Earth Observation (ITC).
Geographically weighted visualization interactive graphics for scale-varying exploratory analysis Chris Brunsdon 1, Jason Dykes 2 1 Department of Geography Leicester University Leicester LE1 7RH firstname.lastname@example.org
Geography 4203 / 5203 GIS Modeling Class (Block) 9: Variogram & Kriging Some Updates Today class + one proposal presentation Feb 22 Proposal Presentations Feb 25 Readings discussion (Interpolation) Last
A HYBRID APPROACH FOR AUTOMATED AREA AGGREGATION Zeshen Wang ESRI 380 NewYork Street Redlands CA 92373 Zwang@esri.com ABSTRACT Automated area aggregation, which is widely needed for mapping both natural
REGIONAL SEDIMENT MANAGEMENT: A GIS APPROACH TO SPATIAL DATA ANALYSIS Lynn Copeland Hardegree, Jennifer M. Wozencraft 1, Rose Dopsovic 2 ABSTRACT: Regional sediment management (RSM) requires the capability
LIBER QUARTERLY, ISSN 1435-5205 LIBER 1999. All rights reserved K.G. Saur, Munich. Printed in Germany Digital Cadastral Maps in Land Information Systems by PIOTR CICHOCINSKI ABSTRACT This paper presents
DATA VISUALIZATION GABRIEL PARODI STUDY MATERIAL: PRINCIPLES OF GEOGRAPHIC INFORMATION SYSTEMS AN INTRODUCTORY TEXTBOOK CHAPTER 7 Contents GIS and maps The visualization process Visualization and strategies
Obesity in America: A Growing Trend David Todd P e n n s y l v a n i a S t a t e U n i v e r s i t y Utilizing Geographic Information Systems (GIS) to explore obesity in America, this study aims to determine
SPATIAL DATA ANALYSIS P.L.N. Raju Geoinformatics Division Indian Institute of Remote Sensing, Dehra Dun Abstract : Spatial analysis is the vital part of GIS. Spatial analysis in GIS involves three types
Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan, Steinbach, Kumar Tan,Steinbach, Kumar Introduction to Data Mining 8/05/2005 1 What is data exploration? A preliminary
ART A. PROGRAM RATIONALE AND PHILOSOPHY Art education is concerned with the organization of visual material. A primary reliance upon visual experience gives an emphasis that sets it apart from the performing
AMARILLO BY MORNING: DATA VISUALIZATION IN GEOSTATISTICS William V. Harper 1 and Isobel Clark 2 1 Otterbein College, United States of America 2 Alloa Business Centre, United Kingdom email@example.com
Exploratory Spatial Data Analysis Part II Dynamically Linked Views 1 Contents Introduction: why to use non-cartographic data displays Display linking by object highlighting Dynamic Query Object classification
perceptual edge Introduction to Geographical Data Visualization Stephen Few, Perceptual Edge Visual Business Intelligence Newsletter March/April 2009 The important stories that numbers have to tell often
3D Visualization of Seismic Activity Associated with the Nazca and South American Plate Subduction Zone (Along Southwestern Chile) Using RockWorks Table of Contents Figure 1: Top of Nazca plate relative
Principles of Data Visualization for Exploratory Data Analysis Renee M. P. Teate SYS 6023 Cognitive Systems Engineering April 28, 2015 Introduction Exploratory Data Analysis (EDA) is the phase of analysis
TOPOGRAPHIC MAPS MAP 2-D REPRESENTATION OF THE EARTH S SURFACE TOPOGRAPHIC MAP A graphic representation of the 3-D configuration of the earth s surface. This is it shows elevations (third dimension). It
Tax Parcel Mapping Visual Representations of Legal Descriptions and So Much More Topics I. E-Distribution II. GIS & Tax Mapping III. Tax Mapping Procedures IV. Deeds, Property Descriptions, & You! I. E-Distribution
COMP 5318 Data Exploration and Analysis Chapter 3 What is data exploration? A preliminary exploration of the data to better understand its characteristics. Key motivations of data exploration include Helping
Introduction An Introduction to Point Pattern Analysis using CrimeStat Luc Anselin Spatial Analysis Laboratory Department of Agricultural and Consumer Economics University of Illinois, Urbana-Champaign
Exploratory Data Analysis for Ecological Modelling and Decision Support Gennady Andrienko & Natalia Andrienko Fraunhofer Institute AIS Sankt Augustin Germany http://www.ais.fraunhofer.de/and 5th ECEM conference,
Visualisation Developing a very high resolution DEM of South Africa by Adriaan van Niekerk, Stellenbosch University DEMs are used in many applications, including hydrology [1, 2], terrain analysis ,
Svetlana Sokolova President and CEO of PROMT, PhD. How the Computer Translates Machine translation is a special field of computer application where almost everyone believes that he/she is a specialist.
Data Mining: Exploring Data Lecture Notes for Chapter 3 Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler Topics Exploratory Data Analysis Summary Statistics Visualization What is data exploration?
Interactive comment on Total cloud cover from satellite observations and climate models by P. Probst et al. Anonymous Referee #1 (Received and published: 20 October 2010) The paper compares CMIP3 model
Objectives Tutorial 8 Raster Data Analysis This tutorial is designed to introduce you to a basic set of raster-based analyses including: 1. Displaying Digital Elevation Model (DEM) 2. Slope calculations
14 Spatial Data Analysis OVERVIEW This chapter is the first in a set of three dealing with geographic analysis and modeling methods. The chapter begins with a review of the relevant terms, and an outlines
PROCESS STANDARDS To help New Mexico students achieve the Content Standards enumerated below, teachers are encouraged to base instruction on the following Process Standards: Problem Solving Build new mathematical
Academic Content Standards Grade Eight and Grade Nine Ohio Algebra 1 2008 Grade Eight STANDARDS Number, Number Sense and Operations Standard Number and Number Systems 1. Use scientific notation to express
Tutorial 3 - Map Symbology in ArcGIS Introduction ArcGIS provides many ways to display and analyze map features. Although not specifically a map-making or cartographic program, ArcGIS does feature a wide
VISUALIZATION STRATEGIES AND TECHNIQUES FOR HIGH-DIMENSIONAL SPATIO- TEMPORAL DATA Summary B. Schmidt, U. Streit and Chr. Uhlenküken University of Münster Institute of Geoinformatics Robert-Koch-Str. 28
Reading Questions Week two Lo and Yeung, 2007: 2 19. Schuurman, 2004: Chapter 1. 1. What distinguishes data from information? How are data represented? 2. What sort of problems are GIS designed to solve?
Page 1 of 10 GIS 101 - Introduction to Geographic Information Systems Last Revision or Approval Date - 9/8/2011 College of the Canyons SECTION A 1. Division: Mathematics and Science 2. Department: Earth,
Application of GIS in Transportation Planning: The Case of Riyadh, the Kingdom of Saudi Arabia Mezyad Alterkawi King Saud University, Kingdom of Saudi Arabia * Abstract This paper is intended to illustrate
Color to Grayscale Conversion with Chrominance Contrast Yuting Ye University of Virginia Figure 1: The sun in Monet s Impression Sunrise has similar luminance as the sky. It can hardly be seen when the
3D Data Visualization / Casey Reas Large scale data visualization offers the ability to see many data points at once. By providing more of the raw data for the viewer to consume, visualization hopes to
Nominal Asset Land Valuation Technique by GIS Tahsin YOMRALIOGLU and Recep NISANCI, Turkey Keywords: Land Valuation, Nominal Asset, GIS SUMMARY Land valuation is the process of assessing the characteristics
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts
Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan, Steinbach, Kumar What is data exploration? A preliminary exploration of the data to better understand its characteristics.
Path Loss Radio Wave Propagation The wireless radio channel puts fundamental limitations to the performance of wireless communications systems Radio channels are extremely random, and are not easily analyzed
Capacity Constrained Smart Grid Design Smart Devices Smart Networks Smart Planning EDX Wireless Tel: +1-541-345-0019 I Fax: +1-541-345-8145 I firstname.lastname@example.org I www.edx.com Mark Chapman and Greg Leon EDX Wireless
The Spatiotemporal Visualization of Historical Earthquake Data in Yellowstone National Park Using ArcGIS GIS and GPS Applications in Earth Science Paul Stevenson, pts453 December 2013 Problem Formulation
Demographics of Atlanta, Georgia: A Visual Analysis of the 2000 and 2010 Census Data 36-315 Final Project Rachel Cohen, Kathryn McKeough, Minnar Xie & David Zimmerman Ethnicities of Atlanta Figure 1: From
W H I T E P A P E R Volumetric Measure Using Geospatial Technology Contents 1. Introduction... 1 2. Project Setup/Triangulation... 1 3. Workflow One: Extract DSM Terrain File... 1 3.1. Stereo Terrain Editing...
Chapter 5 Clustering & Visualization Clustering in high-dimensional databases is an important problem and there are a number of different clustering paradigms which are applicable to high-dimensional data.
Urban Land Use Data for the Telecommunications Industry Regine Richter, Ulla Weingart & Tobias Wever GAF AG Arnulfstr. 197 D-80634 Munich & Ulrike Kähny E-Plus Mobilfunk GmbH & Co. KG E-Plus Platz D-40468
Introduction to Exploratory Data Analysis A SpaceStat Software Tutorial Copyright 2013, BioMedware, Inc. (www.biomedware.com). All rights reserved. SpaceStat and BioMedware are trademarks of BioMedware,
Using Social Media Data to Assess Spatial Crime Hotspots 1 Introduction Nick Malleson 1 and Martin Andreson 2 1 School of Geography, University of Leeds 2 School of Criminology, Simon Fraser University,
GIS Digital Humanities Boot Camp Series GIS Fundamentals GIS Fundamentals Definition of GIS A geographic information system (GIS) is used to describe and characterize spatial data for the purpose of visualizing
PREDICTIVE ANALYTICS VS. HOTSPOTTING A STUDY OF CRIME PREVENTION ACCURACY AND EFFICIENCY EXECUTIVE SUMMARY For the last 20 years, Hot Spots have become law enforcement s predominant tool for crime analysis.
IMPLEMENTING EXPLORATORY SPATIAL DATA ANALYSIS METHODS FOR MULTIVARIATE HEALTH STATISTICS Daniel B. Haug 1, Alan M. MacEachren 2, Francis P. Boscoe 1, David Brown 1, Mark Marra 1, Colin Polsky 1, and Jaishree
Segmentation of building models from dense 3D point-clouds Joachim Bauer, Konrad Karner, Konrad Schindler, Andreas Klaus, Christopher Zach VRVis Research Center for Virtual Reality and Visualization, Institute
Galaxy Morphological Classification Jordan Duprey and James Kolano Abstract To solve the issue of galaxy morphological classification according to a classification scheme modelled off of the Hubble Sequence,
INTRODUCTION TO GEOSTATISTICS And VARIOGRAM ANALYSIS C&PE 940, 17 October 2005 Geoff Bohling Assistant Scientist Kansas Geological Survey email@example.com 864-2093 Overheads and other resources available
1 CHAPTER 1 INTRODUCTION Exploration is a process of discovery. In the database exploration process, an analyst executes a sequence of transformations over a collection of data structures to discover useful
Topographic Change Detection Using CloudCompare Version 1.0 Emily Kleber, Arizona State University Edwin Nissen, Colorado School of Mines J Ramón Arrowsmith, Arizona State University Introduction CloudCompare
Interactive Data Visualization 07 Visualization Techniques for Geospatial Data IDV 2015/2016 Notice n Author t João Moura Pires (firstname.lastname@example.org) n This material can be freely used for personal or academic
Data, Measurements, Features Middle East Technical University Dep. of Computer Engineering 2009 compiled by V. Atalay What do you think of when someone says Data? We might abstract the idea that data are
Information Contents of High Resolution Satellite Images H. Topan, G. Büyüksalih Zonguldak Karelmas University K. Jacobsen University of Hannover, Germany Keywords: satellite images, mapping, resolution,
Visualization Quick Guide A best practice guide to help you find the right visualization for your data WHAT IS DOMO? Domo is a new form of business intelligence (BI) unlike anything before an executive
MAPPING DRUG OVERDOSE IN ADELAIDE Danielle Taylor GIS Specialist GISCA, The National Key Centre for Social Applications of GIS University of Adelaide Roslyn Clermont Corporate Information Officer SA Ambulance
Chapter 5: Analysis of The National Education Longitudinal Study (NELS:88) Introduction The National Educational Longitudinal Survey (NELS:88) followed students from 8 th grade in 1988 to 10 th grade in
PREDICTIVE ANALYTICS vs HOT SPOTTING A STUDY OF CRIME PREVENTION ACCURACY AND EFFICIENCY 2014 EXECUTIVE SUMMARY For the last 20 years, Hot Spots have become law enforcement s predominant tool for crime
Environmental Remote Sensing GEOG 2021 Lecture 4 Image classification 2 Purpose categorising data data abstraction / simplification data interpretation mapping for land cover mapping use land cover class
Natural Neighbour Interpolation DThe Natural Neighbour method is a geometric estimation technique that uses natural neighbourhood regions generated around each point in the data set. The method is particularly
Virtual prototype of the camera lens defined in . Besides the lenses we model only those mechanical parts that potentially contribute the most to stray light Hunting Ghosts STRAY LIGHT ANALYSIS IN IMAGING
Simple maths for keywords Adam Kilgarriff Lexical Computing Ltd email@example.com Abstract We present a simple method for identifying keywords of one corpus vs. another. There is no one-sizefits-all
iemss 2008: International Congress on Environmental Modelling and Software Integrating Sciences and Information Technology for Environmental Assessment and Decision Making 4 th Biennial Meeting of iemss,
Visualization methods for patent data Treparel 2013 Dr. Anton Heijs (CTO & Founder) Delft, The Netherlands Introduction Treparel can provide advanced visualizations for patent data. This document describes
The Equity Premium in India Rajnish Mehra University of California, Santa Barbara and National Bureau of Economic Research January 06 Prepared for the Oxford Companion to Economics in India edited by Kaushik
This is Geospatial Analysis II: Raster Data, chapter 8 from the book Geographic Information System Basics (index.html) (v. 1.0). This book is licensed under a Creative Commons by-nc-sa 3.0 (http://creativecommons.org/licenses/by-nc-sa/
The STC for Event Analysis: Scalability Issues Georg Fuchs Gennady Andrienko http://geoanalytics.net Events Something [significant] happened somewhere, sometime Analysis goal and domain dependent, e.g.
VOLATILITY AND DEVIATION OF DISTRIBUTED SOLAR Andrew Goldstein Yale University 68 High Street New Haven, CT 06511 firstname.lastname@example.org Alexander Thornton Shawn Kerrigan Locus Energy 657 Mission St.
QÜESTIIÓ, vol. 25, 3, p. 509-520, 2001 PRACTICAL DATA MINING IN A LARGE UTILITY COMPANY GEORGES HÉBRAIL We present in this paper the main applications of data mining techniques at Electricité de France,
TECHNISCHE UNIVERSITÄT DRESDEN Fakultät Wirtschaftswissenschaften Dresdner Beiträge zu Quantitativen Verfahren Nr. 37/04 BLUEs for Default Probabilities von Konstantin Vogl und Robert Wania Herausgeber: