Initial travel knowledge graph release
This commit is contained in:
212
docs/reports/new2_current_kg_schema_snapshot.md
Normal file
212
docs/reports/new2_current_kg_schema_snapshot.md
Normal file
@@ -0,0 +1,212 @@
|
||||
# new2 当前知识图谱 Schema 快照
|
||||
|
||||
生成时间:2026-05-28
|
||||
项目目录:`/Users/xuexue/new2`
|
||||
|
||||
## 1. 当前配置
|
||||
|
||||
| 项 | 当前值 | 说明 |
|
||||
|---|---|---|
|
||||
| PostgreSQL schema | `kg_admin_new2` | 系统管理库、候选知识、采集任务、空间 POI 表 |
|
||||
| 默认 FalkorDB 图谱 | `guiyang_new2` | `/admin/plaza/graph` 图谱浏览器默认查询的图 |
|
||||
| 空间采集图谱 | `guiyang_spatial_v1` | 高德 H3 网格采集、附近问答使用的数据源 |
|
||||
|
||||
## 2. PostgreSQL 本体 Schema
|
||||
|
||||
当前 `kg_admin_new2.ontology_schemas` 只有 1 个 active schema:
|
||||
|
||||
| id | 名称 | 状态 | 版本 |
|
||||
|---:|---|---|---:|
|
||||
| 2 | 贵阳地理知识图谱Schema | active | 1 |
|
||||
|
||||
实体类型:
|
||||
|
||||
| Entity Type | 说明 |
|
||||
|---|---|
|
||||
| `Area` | 地理区域 |
|
||||
| `Place` | 地点/站点 |
|
||||
|
||||
关系类型:
|
||||
|
||||
| Relation Type | Source | Target | 说明 |
|
||||
|---|---|---|---|
|
||||
| `LOCATED_IN` | `Area` | `Area` | 区域位于区域 |
|
||||
| `LOCATED_IN` | `Place` | `Area` | 地点位于区域 |
|
||||
|
||||
字段:
|
||||
|
||||
| Entity | Fields |
|
||||
|---|---|
|
||||
| `Area` | `name`, `code`, `description`, `level`, `district`, `lat`, `lng` |
|
||||
| `Place` | `name`, `city`, `district`, `lat`, `lng`, `station_type`, `station_class`, `is_transfer` |
|
||||
|
||||
结论:PostgreSQL 本体 Schema 仍然是早期的“地理区域 + 地点”版本,没有完整体现后续设计的 `Entity / Event / Concept / Relation / Statement`,也没有完整体现高德空间 POI 分类。
|
||||
|
||||
## 3. 默认图谱 `guiyang_new2`
|
||||
|
||||
节点标签:
|
||||
|
||||
| Label | Count |
|
||||
|---|---:|
|
||||
| `Place` | 26156 |
|
||||
| `Area` | 1612 |
|
||||
| `ExperienceTag` | 337 |
|
||||
| `BusLine` | 234 |
|
||||
| `Literal` | 10 |
|
||||
| `Event` | 9 |
|
||||
| `RouteTemplate` | 7 |
|
||||
| `Concept` | 6 |
|
||||
| `NaturalFeature` | 5 |
|
||||
| `ScenicSpot` | 2 |
|
||||
| `Facility` | 1 |
|
||||
| `CulturalSite` | 1 |
|
||||
| `MemorialSite` | 1 |
|
||||
|
||||
关系类型:
|
||||
|
||||
| Relation Type | Count |
|
||||
|---|---:|
|
||||
| `HAS_TAG` | 665 |
|
||||
| `STOPS_AT` | 4775 |
|
||||
| `LOCATED_IN` | 1 |
|
||||
| `HAS_ADDRESS` | 1 |
|
||||
| `HAS_CLIMATE` | 1 |
|
||||
| `HAS_OPENING_HOURS` | 1 |
|
||||
| `HAS_SCENIC_LEVEL` | 1 |
|
||||
| `HAS_TICKET_PRICE` | 1 |
|
||||
| `HAS_AREA` | 1 |
|
||||
| `HAS_REPUTATION` | 1 |
|
||||
| `HAS_SUGGESTED_DURATION` | 1 |
|
||||
| `HAS_BEST_SEASON` | 1 |
|
||||
| `FORMER_NAME` | 1 |
|
||||
| `HAS_PART` | 7 |
|
||||
| `HAS_EVENT` | 9 |
|
||||
| `HAS_CONCEPT` | 6 |
|
||||
| `NEARBY_ATTRACTION` | 2 |
|
||||
| `NEAR_TRANSIT` | 7 |
|
||||
|
||||
结论:这个图已经有 `Event`、`Concept`、`Literal`,但主要是花溪公园/旧采集数据和交通线等混合数据;不是新的高德 H3 全量空间图谱。
|
||||
|
||||
## 4. 空间图谱 `guiyang_spatial_v1`
|
||||
|
||||
节点标签:
|
||||
|
||||
| Label | Count |
|
||||
|---|---:|
|
||||
| `Place` | 40011 |
|
||||
| `POI` | 8590 |
|
||||
| `FoodPlace` | 8251 |
|
||||
| `GeoCell` | 3637 |
|
||||
| `Mall` | 3192 |
|
||||
| `EnterprisePlace` | 3017 |
|
||||
| `TransitFacility` | 2772 |
|
||||
| `LifeServicePlace` | 2692 |
|
||||
| `GovernmentPlace` | 2320 |
|
||||
| `ResidentialPlace` | 2146 |
|
||||
| `EducationPlace` | 1851 |
|
||||
| `Hotel` | 1794 |
|
||||
| `MedicalPlace` | 1681 |
|
||||
| `ScenicSpot` | 1026 |
|
||||
| `Facility` | 679 |
|
||||
| `Area` | 8 |
|
||||
|
||||
关系类型:
|
||||
|
||||
| Relation Type | Count |
|
||||
|---|---:|
|
||||
| `LOCATED_IN` | 40011 |
|
||||
| `IN_H3_R9` | 40011 |
|
||||
| `PART_OF` | 7 |
|
||||
|
||||
说明:标签计数会有重叠,例如一个美食 POI 可以同时带 `Place` 和 `FoodPlace`。
|
||||
|
||||
## 5. 空间 POI 表 Schema
|
||||
|
||||
表:`kg_admin_new2.amap_spatial_pois`
|
||||
|
||||
关键字段:
|
||||
|
||||
| 字段 | 类型 | 作用 |
|
||||
|---|---|---|
|
||||
| `graph_name` | text | 图谱名,例如 `guiyang_spatial_v1` |
|
||||
| `gaode_poi_id` | text | 高德 POI ID |
|
||||
| `element_id` | text | 图谱节点 ID |
|
||||
| `name` | text | 名称 |
|
||||
| `type_label` | text | 中文业务分类,例如 美食、酒店、景点 |
|
||||
| `place_type` | text | 内部分类,例如 eat、hotel、sight |
|
||||
| `amap_type`, `typecode` | text | 高德原始分类 |
|
||||
| `lng`, `lat` | double precision | 经纬度 |
|
||||
| `h3_r6` ~ `h3_r10` | text | 多分辨率 H3 空间索引 |
|
||||
| `province`, `city`, `district`, `adcode` | text | 行政区划 |
|
||||
| `address`, `tel`, `open_time`, `rating`, `cost`, `level`, `tags` | text | POI 属性 |
|
||||
| `photo_urls` | jsonb | 高德照片 URL 数组 |
|
||||
| `source_cell_id`, `source_resolution`, `source_scope_adcode` | text/int | 采集来源网格和行政区 |
|
||||
| `raw_jsonb` | jsonb | 高德原始返回 |
|
||||
| `first_fetched_at`, `last_fetched_at` | timestamptz | 首次/最近采集时间 |
|
||||
|
||||
当前 `guiyang_spatial_v1` 分类数量:
|
||||
|
||||
| 中文分类 | 内部类型 | Count |
|
||||
|---|---|---:|
|
||||
| 美食 | eat | 8217 |
|
||||
| 商场 | mall | 3045 |
|
||||
| 公司企业 | enterprise | 3016 |
|
||||
| 交通设施 | transit | 2772 |
|
||||
| 生活服务 | life | 2609 |
|
||||
| 政府机构 | government | 2309 |
|
||||
| 商务住宅 | residential | 2138 |
|
||||
| 体育休闲 | poi | 1972 |
|
||||
| 汽车服务 | poi | 1899 |
|
||||
| 科教文化 | education | 1838 |
|
||||
| 酒店 | hotel | 1779 |
|
||||
| 医疗保健 | medical | 1680 |
|
||||
| 汽车维修 | poi | 1266 |
|
||||
| 地名地址 | poi | 1110 |
|
||||
| 景点 | sight | 1014 |
|
||||
| 金融保险 | poi | 975 |
|
||||
| 公共设施 | facility | 679 |
|
||||
| 汽车销售 | poi | 663 |
|
||||
| 摩托车服务 | poi | 628 |
|
||||
| 道路附属 | poi | 42 |
|
||||
|
||||
## 6. 当前问题判断
|
||||
|
||||
1. `/admin/modeling/schema` 可以查看本体 Schema 页面,但目前只显示 GraphRelease 版本列表,不是完整 Schema 浏览器。
|
||||
2. `ontology_schemas` 里的 active schema 仍然偏旧,只覆盖 `Area / Place / LOCATED_IN`。
|
||||
3. 真正支撑“附近问答”的是 `guiyang_spatial_v1` + `amap_spatial_pois`,它的空间 Schema 已经有 H3 字段和分类标签。
|
||||
4. 后续如果要严肃做通用知识图谱,需要把 PostgreSQL 本体 Schema 升级为统一版本,至少包含:
|
||||
- `Place / POI / Area / GeoCell`
|
||||
- `FoodPlace / Hotel / ScenicSpot / TransitFacility / MedicalPlace / Mall / LifeServicePlace / EducationPlace`
|
||||
- `Event / Concept / Literal / Evidence / Statement`
|
||||
- `LOCATED_IN / IN_H3_R9 / HAS_EVENT / HAS_CONCEPT / HAS_ATTRIBUTE / NEAR_TRANSIT / PART_OF / SAME_AS`
|
||||
|
||||
## 7. 常用查看命令
|
||||
|
||||
查看默认图谱标签:
|
||||
|
||||
```bash
|
||||
docker exec zn-kg-new2-falkordb redis-cli -p 6379 GRAPH.QUERY guiyang_new2 \
|
||||
"CALL db.labels() YIELD label RETURN label" --raw
|
||||
```
|
||||
|
||||
查看默认图谱关系类型:
|
||||
|
||||
```bash
|
||||
docker exec zn-kg-new2-falkordb redis-cli -p 6379 GRAPH.QUERY guiyang_new2 \
|
||||
"CALL db.relationshipTypes() YIELD relationshipType RETURN relationshipType" --raw
|
||||
```
|
||||
|
||||
查看空间图谱标签:
|
||||
|
||||
```bash
|
||||
docker exec zn-kg-new2-falkordb redis-cli -p 6379 GRAPH.QUERY guiyang_spatial_v1 \
|
||||
"CALL db.labels() YIELD label RETURN label" --raw
|
||||
```
|
||||
|
||||
查看空间 POI 表结构:
|
||||
|
||||
```bash
|
||||
docker exec zn-kg-new2-postgres psql -U admin -d kg_admin \
|
||||
-c "\\d kg_admin_new2.amap_spatial_pois"
|
||||
```
|
||||
|
||||
Reference in New Issue
Block a user