Contents Menu Expand Light mode Dark mode Auto light/dark, in light mode Auto light/dark, in dark mode Skip to content
Documentation for World Historical Gazetteer Latest Release
Logo
Documentation for World Historical Gazetteer Latest Release
  • Introduction
  • Guides & Tutorials
    • 1. Our Indexes
    • 2. Workbench
    • 3. Publishing Data
    • 4. Uploading Data
    • 5. Reconciliation & Accessioning
    • 6. Reviewing accessioning results
    • 7. Collection Groups
  • Technical
    • 1. Repositories
    • 2. APIs
    • 3. Issues
  • Development Roadmap
    • v3.5: Toponym Phonetics
      • 1. Overview
      • 2. Components
      • 3. Data Flow
      • 4. Elasticsearch Index Design
      • 5. Training the Siamese BiLSTM
      • 6. Query Pipeline
      • 7. Push-Based Synchronisation Strategy
      • 8. Advantages of This Architecture
      • 9. Monitoring & Observability
      • 10. Future Extensions
      • 11. Deployment Plan
      • 12. Risk Assessment
      • 13. Success Criteria
      • 14. Summary
      • 15. References
    • V4: Graph Datamodel
      • 1. User Guide
        • 1.1.1. Quick Start Guide
        • 1.1.2. Understanding WHG Concepts
        • 1.1.3. Place Record Anatomy
        • 1.1.4. Contributing Data Overview
        • 1.1.5. Reconciliation Overview
        • 1.1.6. Tutorial: Creating a Historical Route
        • 1.1.7. Frequently Asked Questions
        • 1.1.8. Glossary
      • 2. Open Educational Resources (OER)
      • 3. Data Model
        • 3.1. Introduction
        • 3.2. Overview
        • 3.3. Attestations & Relations
        • 3.4. Vocabularies
        • 3.5. Special Thing Patterns
        • 3.6. Contribution Types & Data Formats
        • 3.7. RDF Representation
        • 3.8. Platform Use Cases
        • 3.9. Implementation in ArangoDB
        • 3.10. Summary & Future Directions
      • 4. System Architecture
        • 4.1. Database Technology Assessment
        • 4.2. Kubernetes Configuration
        • 4.3. SSH Key Setup
        • 4.4. Deploying the Management Pod
        • 4.5. Deploying Services
        • 4.6. Service Configuration
  • License
Back to top
View this page
Edit this page
  • Introduction
    • Vision
    • Mission
  • Guides & Tutorials
    • 1. Our Indexes
      • 1.1. Wikidata+GeoNames Index (Augmentation)
      • 1.2. WHG Publication Index (Searchable Publication)
      • 1.3. WHG Union Index (Final Integration & Clustering)
    • 2. Workbench
      • 2.1. Individual datasets
      • 2.2. Multiple datasets
      • 2.3. Thematic place collections
        • 2.3.1. Instructional exercise in a class setting, or workshop
        • 2.3.2. Authored publication
    • 3. Publishing Data
      • 3.1. Create and publish a Place Collection
      • 3.2. Create and publish a Dataset Collection
    • 4. Uploading Data
      • 4.1. Choosing an upload data format: LPF or LP-TSV?
      • 4.2. Preparing data for upload
        • 4.2.1. The simple case
        • 4.2.2. The not so simple case: extracting places
    • 5. Reconciliation & Accessioning
      • 5.1. What does closeMatch mean?
    • 6. Reviewing accessioning results
    • 7. Collection Groups
      • 7.1. Create and manage a Collection Group for a class or workshop
  • Technical
    • 1. Repositories
    • 2. APIs
      • 2.1. Entity API
      • 2.2. Reconciliation Service API
        • 2.2.1. Using the WHG Reconciliation API in OpenRefine
      • 2.3. API Tokens
        • 2.3.1. Using an API Token
    • 3. Issues
  • Development Roadmap
    • v3.5: Toponym Phonetics
      • 1. Overview
      • 2. Components
        • 2.1. Online Components (DigitalOcean)
        • 2.2. Offline Components (Pitt CRC)
        • 2.3. Rationale for Separate Indices
      • 3. Data Flow
        • 3.1. Initial Migration (One-Time, at Pitt)
        • 3.2. Ongoing Ingestion (New Datasets)
        • 3.3. Linking Toponyms to IPA
        • 3.4. Linking IPA to Places (Implicit)
        • 3.5. IPA Generation: Epitran + PanPhon
        • 3.6. IPA Normalisation Rules
      • 4. Elasticsearch Index Design
        • 4.1. IPA Index
        • 4.2. Toponym Index
        • 4.3. Place Index
      • 5. Training the Siamese BiLSTM
        • 5.1. Training Data Construction
        • 5.2. Model Architecture
        • 5.3. Embedding Refresh Cycle
        • 5.4. Real-Time Inference Model
      • 6. Query Pipeline
        • 6.1. Full Pipeline with Fallbacks
        • 6.2. Query-Time Optimisations
        • 6.3. Error Handling
      • 7. Push-Based Synchronisation Strategy
        • 7.1. Rationale
        • 7.2. Bulk Update Workflow
        • 7.3. Authentication & Security
        • 7.4. Resilience Strategy
        • 7.5. Verification Strategy
      • 8. Advantages of This Architecture
      • 9. Monitoring & Observability
        • 9.1. Key Metrics
        • 9.2. Dashboards
        • 9.3. Alerting
      • 10. Future Extensions
      • 11. Deployment Plan
        • 11.1. Phase 1: Development (Week 1-4)
        • 11.2. Phase 2: Initial Migration (Week 5-8)
        • 11.3. Phase 3: Model Training (Week 9-12)
        • 11.4. Phase 4: Production Rollout (Week 13-16)
        • 11.5. Phase 5: Continuous Improvement (Ongoing)
      • 12. Risk Assessment
      • 13. Success Criteria
        • 13.1. Technical Metrics
        • 13.2. User Experience Metrics
        • 13.3. Research Impact
      • 14. Summary
      • 15. References
    • V4: Graph Datamodel
      • 1. User Guide
        • 1.1. Note to Documentation Team
        • 1.2. Getting Help
      • 2. Open Educational Resources (OER)
        • 2.1. Vision
        • 2.2. Strategic Goals
        • 2.3. Technical Requirements
      • 3. Data Model
        • 3.1. Introduction
        • 3.2. Overview
        • 3.3. Attestations & Relations
        • 3.4. Vocabularies
        • 3.5. Special Thing Patterns
        • 3.6. Contribution Types & Data Formats
        • 3.7. RDF Representation
        • 3.8. Platform Use Cases
        • 3.9. Implementation in ArangoDB
        • 3.10. Summary & Future Directions
      • 4. System Architecture
        • 4.1. Database Technology Assessment
        • 4.2. Kubernetes Configuration
        • 4.3. SSH Key Setup
        • 4.4. Deploying the Management Pod
        • 4.5. Deploying Services
        • 4.6. Service Configuration
  • License
    • Creative Commons Attribution-NonCommercial 4.0 International Public License
      • Section 1 – Definitions.
      • Section 2 – Scope.
      • Section 3 – License Conditions.
      • Section 4 – Sui Generis Database Rights.
      • Section 5 – Disclaimer of Warranties and Limitation of Liability.
      • Section 6 – Term and Termination.
      • Section 7 – Other Terms and Conditions.
      • Section 8 – Interpretation.
Copyright ©2017–2025 World Historical Gazetteer
Last updated on 17 November 2025