Harvest for BioImages, the virtual fieldguide, UK Created 19 Dec 05:07

Stage: completed
Fetched: 19 Dec 05:07
Validated: 19 Dec 05:07
Deltas Created 19 Dec 05:07
Units Normalized: 19 Dec 05:32
Ancestry Built: 19 Dec 05:14
Nodes Matched: 19 Dec 05:32
Names Parsed: 19 Dec 05:14
New Models Stored: 19 Dec 05:12
Indexed: 19 Dec 05:32
Completed: 19 Dec 05:51
Time to Harvest: 1 minute

Harvesting Log

(216 lines)
[INFO] [2023-12-19 05:07:30] Created harvest instance #4511
[STOP] [2023-12-19 05:07:30] create_harvest_instance
[START] [2023-12-19 05:07:30] fetch_files
[STOP] [2023-12-19 05:07:30] fetch_files
[START] [2023-12-19 05:07:30] validate_each_file
[INFO] [2023-12-19 05:07:30] Created new folder: /app/public/converted_csv
[INFO] [2023-12-19 05:07:30] Looping over 3 formats...
[INFO] [2023-12-19 05:07:30] ...agents (/app/public/data/bio_imgs_dw_bioi/agent.tab)
[INFO] [2023-12-19 05:07:30] Valid: /app/public/data/bio_imgs_dw_bioi/converted_csv/bio_imgs_dw_bioi_agents_31021.csv (3 lines)
[INFO] [2023-12-19 05:07:30] ...nodes (/app/public/data/bio_imgs_dw_bioi/taxon.tab)
[INFO] [2023-12-19 05:07:31] Valid: /app/public/data/bio_imgs_dw_bioi/converted_csv/bio_imgs_dw_bioi_nodes_31023.csv (23065 lines)
[INFO] [2023-12-19 05:07:31] ...media (/app/public/data/bio_imgs_dw_bioi/media_resource.tab)
[INFO] [2023-12-19 05:07:43] Valid: /app/public/data/bio_imgs_dw_bioi/converted_csv/bio_imgs_dw_bioi_media_31022.csv (146605 lines)
[STOP] [2023-12-19 05:07:43] validate_each_file
[START] [2023-12-19 05:07:43] convert_to_csv
[INFO] [2023-12-19 05:07:43] Looping over 3 formats...
[INFO] [2023-12-19 05:07:43] ...agents (/app/public/data/bio_imgs_dw_bioi/agent.tab)
[CMD] [2023-12-19 05:07:43] /usr/bin/sort /app/public/data/bio_imgs_dw_bioi/converted_csv/bio_imgs_dw_bioi_agents_31021.csv > /app/public/data/bio_imgs_dw_bioi/converted_csv/bio_imgs_dw_bioi_agents_31021.csv_sorted
[INFO] [2023-12-19 05:07:43] Converted: /app/public/data/bio_imgs_dw_bioi/converted_csv/bio_imgs_dw_bioi_agents_31021.csv (3 lines)
[INFO] [2023-12-19 05:07:43] ...nodes (/app/public/data/bio_imgs_dw_bioi/taxon.tab)
[CMD] [2023-12-19 05:07:43] /usr/bin/sort /app/public/data/bio_imgs_dw_bioi/converted_csv/bio_imgs_dw_bioi_nodes_31023.csv > /app/public/data/bio_imgs_dw_bioi/converted_csv/bio_imgs_dw_bioi_nodes_31023.csv_sorted
[INFO] [2023-12-19 05:07:44] Converted: /app/public/data/bio_imgs_dw_bioi/converted_csv/bio_imgs_dw_bioi_nodes_31023.csv (23065 lines)
[INFO] [2023-12-19 05:07:44] ...media (/app/public/data/bio_imgs_dw_bioi/media_resource.tab)
[CMD] [2023-12-19 05:07:44] /usr/bin/sort /app/public/data/bio_imgs_dw_bioi/converted_csv/bio_imgs_dw_bioi_media_31022.csv > /app/public/data/bio_imgs_dw_bioi/converted_csv/bio_imgs_dw_bioi_media_31022.csv_sorted
[INFO] [2023-12-19 05:07:46] Converted: /app/public/data/bio_imgs_dw_bioi/converted_csv/bio_imgs_dw_bioi_media_31022.csv (146605 lines)
[STOP] [2023-12-19 05:07:46] convert_to_csv
[START] [2023-12-19 05:07:46] calculate_delta
[INFO] [2023-12-19 05:07:46] Created diff dir: /app/public/diff
[INFO] [2023-12-19 05:07:46] Looping over 3 formats...
[INFO] [2023-12-19 05:07:46] ...agents (/app/public/data/bio_imgs_dw_bioi/agent.tab)
[CMD] [2023-12-19 05:07:46] echo "0a" > /app/public/data/bio_imgs_dw_bioi/diff/bio_imgs_dw_bioi_agents_31021.diff
[CMD] [2023-12-19 05:07:46] tail -n +1 /app/public/data/bio_imgs_dw_bioi/converted_csv/bio_imgs_dw_bioi_agents_31021.csv >> /app/public/data/bio_imgs_dw_bioi/diff/bio_imgs_dw_bioi_agents_31021.diff
[CMD] [2023-12-19 05:07:46] echo "." >> /app/public/data/bio_imgs_dw_bioi/diff/bio_imgs_dw_bioi_agents_31021.diff
[INFO] [2023-12-19 05:07:46] Created diff: /app/public/data/bio_imgs_dw_bioi/diff/bio_imgs_dw_bioi_agents_31021.diff (5 lines)
[INFO] [2023-12-19 05:07:46] ...nodes (/app/public/data/bio_imgs_dw_bioi/taxon.tab)
[CMD] [2023-12-19 05:07:46] echo "0a" > /app/public/data/bio_imgs_dw_bioi/diff/bio_imgs_dw_bioi_nodes_31023.diff
[CMD] [2023-12-19 05:07:46] tail -n +1 /app/public/data/bio_imgs_dw_bioi/converted_csv/bio_imgs_dw_bioi_nodes_31023.csv >> /app/public/data/bio_imgs_dw_bioi/diff/bio_imgs_dw_bioi_nodes_31023.diff
[CMD] [2023-12-19 05:07:47] echo "." >> /app/public/data/bio_imgs_dw_bioi/diff/bio_imgs_dw_bioi_nodes_31023.diff
[INFO] [2023-12-19 05:07:47] Created diff: /app/public/data/bio_imgs_dw_bioi/diff/bio_imgs_dw_bioi_nodes_31023.diff (23067 lines)
[INFO] [2023-12-19 05:07:47] ...media (/app/public/data/bio_imgs_dw_bioi/media_resource.tab)
[CMD] [2023-12-19 05:07:47] echo "0a" > /app/public/data/bio_imgs_dw_bioi/diff/bio_imgs_dw_bioi_media_31022.diff
[CMD] [2023-12-19 05:07:47] tail -n +1 /app/public/data/bio_imgs_dw_bioi/converted_csv/bio_imgs_dw_bioi_media_31022.csv >> /app/public/data/bio_imgs_dw_bioi/diff/bio_imgs_dw_bioi_media_31022.diff
[CMD] [2023-12-19 05:07:48] echo "." >> /app/public/data/bio_imgs_dw_bioi/diff/bio_imgs_dw_bioi_media_31022.diff
[INFO] [2023-12-19 05:07:48] Created diff: /app/public/data/bio_imgs_dw_bioi/diff/bio_imgs_dw_bioi_media_31022.diff (146607 lines)
[STOP] [2023-12-19 05:07:48] calculate_delta
[START] [2023-12-19 05:07:48] parse_diff_and_store
[INFO] [2023-12-19 05:07:48] Handling diff: /app/public/data/bio_imgs_dw_bioi/diff/bio_imgs_dw_bioi_agents_31021.diff (5 lines)
[INFO] [2023-12-19 05:07:48] Loading agents diff file into memory (5 lines)...
[INFO] [2023-12-19 05:07:48] Storing 3 Attributions (3/3/5)
[INFO] [2023-12-19 05:07:48] Handling diff: /app/public/data/bio_imgs_dw_bioi/diff/bio_imgs_dw_bioi_nodes_31023.diff (23067 lines)
[INFO] [2023-12-19 05:07:48] Loading nodes diff file into memory (23067 lines)...
[WARN] [2023-12-19 05:07:49] Filtered Scientific Name `Oulema melanopus/rufocyanea agg.` to `Oulema melanopusrufocyanea agg.`
[WARN] [2023-12-19 05:07:49] Filtered Scientific Name `Melangyna compositarum/labiatarum` to `Melangyna compositarumlabiatarum`
[WARN] [2023-12-19 05:07:50] Filtered Scientific Name `Acronicta psi/tridens` to `Acronicta psitridens`
[WARN] [2023-12-19 05:07:50] Filtered Scientific Name `Phyllonorycter froelichiella/kleemanella/rajella` to `Phyllonorycter froelichiellakleemanellarajella`
[WARN] [2023-12-19 05:07:50] Filtered Scientific Name `Hydropsyche fulvipes/instabilis` to `Hydropsyche fulvipesinstabilis`
[WARN] [2023-12-19 05:07:50] Filtered Scientific Name `Entoloma indutoides var.  griseorubidum (Noordel.) Noordel., W` to `Entoloma indutoides var. griseorubidum (Noordel.) Noordel., W`
[WARN] [2023-12-19 05:07:51] Filtered Scientific Name `Acleris laterana/schalleriana` to `Acleris lateranaschalleriana`
[WARN] [2023-12-19 05:07:51] Filtered Scientific Name `Archips podana/operana` to `Archips podanaoperana`
[WARN] [2023-12-19 05:07:51] Filtered Scientific Name `Phigalia pilosaria / Philereme transversata` to `Phigalia pilosaria Philereme transversata`
[WARN] [2023-12-19 05:07:51] Filtered Scientific Name `Thymelicus lineola / sylvestris` to `Thymelicus lineola sylvestris`
[WARN] [2023-12-19 05:07:51] Filtered Scientific Name `Spilosoma lubricipedia / luteum` to `Spilosoma lubricipedia luteum`
[WARN] [2023-12-19 05:07:51] Filtered Scientific Name `Eriogaster lanestris / Malacosoma neustria` to `Eriogaster lanestris Malacosoma neustria`
[WARN] [2023-12-19 05:07:51] Filtered Scientific Name `Dypterygia scabriuscula / Hyloicus pinastri` to `Dypterygia scabriuscula Hyloicus pinastri`
[WARN] [2023-12-19 05:07:51] Filtered Scientific Name `Nemapogon cloacella / Schiffermuelleria similella` to `Nemapogon cloacella Schiffermuelleria similella`
[WARN] [2023-12-19 05:07:51] Filtered Scientific Name `Podosphaera clandestina/spiraeae` to `Podosphaera clandestinaspiraeae`
[WARN] [2023-12-19 05:07:52] Filtered Scientific Name `Carlina/Carduus/Cirsium` to `CarlinaCarduusCirsium`
[INFO] [2023-12-19 05:07:52] Storing 11901 ScientificNames (23802/10000/23067)
[INFO] [2023-12-19 05:07:55] Storing 11901 Nodes (23802/10000/23067)
[WARN] [2023-12-19 05:08:00] Filtered Scientific Name `Cephalosporium balanoides Drechsler,  1941` to `Cephalosporium balanoides Drechsler, 1941`
[WARN] [2023-12-19 05:08:00] Filtered Scientific Name `Leveillula clavata   poinsettia powdery mildew (Leveillula   clavata` to `Leveillula clavata poinsettia powdery mildew (Leveillula clavata`
[WARN] [2023-12-19 05:08:01] Filtered Scientific Name `Araniella cucurbitina/opisthographa` to `Araniella cucurbitinaopisthographa`
[WARN] [2023-12-19 05:08:02] Filtered Scientific Name `Syrphus vitripennis/rectus` to `Syrphus vitripennisrectus`
[WARN] [2023-12-19 05:08:02] Filtered Scientific Name `Cordyceps canadensis/capitata` to `Cordyceps canadensiscapitata`
[WARN] [2023-12-19 05:08:02] Filtered Scientific Name `Littorina obtusata/fabalis` to `Littorina obtusatafabalis`
[WARN] [2023-12-19 05:08:02] Filtered Scientific Name `Chamaepsila nigricornis/rosae` to `Chamaepsila nigricornisrosae`
[WARN] [2023-12-19 05:08:02] Filtered Scientific Name `Meteorus abdominator/longipilosus` to `Meteorus abdominatorlongipilosus`
[WARN] [2023-12-19 05:08:02] Filtered Scientific Name `Centaurea nigra sens. lat. (=nigra/debeauxii)` to `Centaurea nigra sens. lat. (=nigradebeauxii)`
[WARN] [2023-12-19 05:08:02] Filtered Scientific Name `Crangonyx pseudogracilis/floridanus s.l.` to `Crangonyx pseudogracilisfloridanus s.l.`
[WARN] [2023-12-19 05:08:02] Filtered Scientific Name `Stagnicola palustris/fuscus/corvus` to `Stagnicola palustrisfuscuscorvus`
[WARN] [2023-12-19 05:08:02] Filtered Scientific Name `Xanthogramma pedissequum/stackelbergi` to `Xanthogramma pedissequumstackelbergi`
[WARN] [2023-12-19 05:08:02] Filtered Scientific Name `Tenthredopsis nassata/scutellaris (males)` to `Tenthredopsis nassatascutellaris (males)`
[WARN] [2023-12-19 05:08:02] (Reached filtered-name limit; supressing further warnings.)
[WARN] [2023-12-19 05:08:03] SKIPPED 1289 Scientific names (48510/20000/23067) with resource_pks already be in the database!
[WARN] [2023-12-19 05:08:03] SKIPPED 1289 Nodes (48510/20000/23067) with resource_pks already be in the database!
[INFO] [2023-12-19 05:08:03] Storing 11065 ScientificNames (48510/20000/23067)
[INFO] [2023-12-19 05:08:07] Storing 11065 Nodes (48510/20000/23067)
[WARN] [2023-12-19 05:08:12] SKIPPED 1028 Scientific names (56978/23065/23067) with resource_pks already be in the database!
[WARN] [2023-12-19 05:08:12] SKIPPED 1028 Nodes (56978/23065/23067) with resource_pks already be in the database!
[INFO] [2023-12-19 05:08:12] Storing 3206 ScientificNames (56978/23065/23067)
[INFO] [2023-12-19 05:08:13] Storing 3206 Nodes (56978/23065/23067)
[INFO] [2023-12-19 05:08:14] Handling diff: /app/public/data/bio_imgs_dw_bioi/diff/bio_imgs_dw_bioi_media_31022.diff (146607 lines)
[INFO] [2023-12-19 05:08:14] Loading media diff file into memory (146607 lines)...
[INFO] [2023-12-19 05:08:29] Storing 17553 ContentAttributions (29997/10000/146607)
[INFO] [2023-12-19 05:08:31] Storing 2445 ArticlesSections (29997/10000/146607)
[INFO] [2023-12-19 05:08:31] Storing 2445 Articles (29997/10000/146607)
[INFO] [2023-12-19 05:08:32] Storing 7554 Media (29997/10000/146607)
[INFO] [2023-12-19 05:08:45] Storing 20000 ContentAttributions (59997/20000/146607)
[INFO] [2023-12-19 05:08:47] Storing 10000 Media (59997/20000/146607)
[INFO] [2023-12-19 05:09:01] Storing 20000 ContentAttributions (89997/30000/146607)
[INFO] [2023-12-19 05:09:03] Storing 10000 Media (89997/30000/146607)
[INFO] [2023-12-19 05:09:17] Storing 20000 ContentAttributions (119997/40000/146607)
[INFO] [2023-12-19 05:09:18] Storing 10000 Media (119997/40000/146607)
[INFO] [2023-12-19 05:09:32] Storing 20000 ContentAttributions (149997/50000/146607)
[INFO] [2023-12-19 05:09:34] Storing 10000 Media (149997/50000/146607)
[INFO] [2023-12-19 05:09:48] Storing 20000 ContentAttributions (179997/60000/146607)
[INFO] [2023-12-19 05:09:50] Storing 10000 Media (179997/60000/146607)
[INFO] [2023-12-19 05:10:04] Storing 20000 ContentAttributions (209997/70000/146607)
[INFO] [2023-12-19 05:10:06] Storing 10000 Media (209997/70000/146607)
[INFO] [2023-12-19 05:10:20] Storing 20000 ContentAttributions (239997/80000/146607)
[INFO] [2023-12-19 05:10:22] Storing 10000 Media (239997/80000/146607)
[INFO] [2023-12-19 05:10:36] Storing 20000 ContentAttributions (269997/90000/146607)
[INFO] [2023-12-19 05:10:38] Storing 10000 Media (269997/90000/146607)
[INFO] [2023-12-19 05:10:52] Storing 20000 ContentAttributions (299997/100000/146607)
[INFO] [2023-12-19 05:10:53] Storing 10000 Media (299997/100000/146607)
[INFO] [2023-12-19 05:11:08] Storing 20000 ContentAttributions (329997/110000/146607)
[INFO] [2023-12-19 05:11:10] Storing 10000 Media (329997/110000/146607)
[INFO] [2023-12-19 05:11:24] Storing 20000 ContentAttributions (359997/120000/146607)
[INFO] [2023-12-19 05:11:26] Storing 10000 Media (359997/120000/146607)
[INFO] [2023-12-19 05:11:40] Storing 20000 ContentAttributions (389997/130000/146607)
[INFO] [2023-12-19 05:11:42] Storing 10000 Media (389997/130000/146607)
[INFO] [2023-12-19 05:11:58] Storing 10969 ContentAttributions (419997/140000/146607)
[INFO] [2023-12-19 05:11:59] Storing 969 Media (419997/140000/146607)
[INFO] [2023-12-19 05:12:00] Storing 9031 ArticlesSections (419997/140000/146607)
[INFO] [2023-12-19 05:12:00] Storing 9031 Articles (419997/140000/146607)
[INFO] [2023-12-19 05:12:14] Storing 6606 ContentAttributions (439815/146605/146607)
[INFO] [2023-12-19 05:12:15] Storing 6606 ArticlesSections (439815/146605/146607)
[INFO] [2023-12-19 05:12:15] Storing 6606 Articles (439815/146605/146607)
[STOP] [2023-12-19 05:12:17] parse_diff_and_store
[START] [2023-12-19 05:12:17] resolve_keys
[2023-12-19 05:12:40] Resolving downloaded urls (this is not actually downloading them yet)
[INFO] [2023-12-19 05:13:51] Occurrences to nodes (through scientific_names)...
[INFO] [2023-12-19 05:13:51] traits to occurrences...
[INFO] [2023-12-19 05:13:51] traits to nodes (through occurrences)...
[INFO] [2023-12-19 05:13:51] Traits to sex term...
[INFO] [2023-12-19 05:13:51] Traits to lifestage term...
[INFO] [2023-12-19 05:13:51] MetaTraits to traits...
[INFO] [2023-12-19 05:13:51] MetaTraits (simple, measurement row refers to parent) to traits...
[INFO] [2023-12-19 05:13:51] Assocs to occurrences...
[INFO] [2023-12-19 05:13:51] Assocs to nodes...
[INFO] [2023-12-19 05:13:51] Assoc to sex term...
[INFO] [2023-12-19 05:13:51] Assoc to lifestage term...
[INFO] [2023-12-19 05:13:51] MetaAssoc to assocs...
[STOP] [2023-12-19 05:14:07] resolve_keys
[START] [2023-12-19 05:14:07] hold_for_later_1
[STOP] [2023-12-19 05:14:07] hold_for_later_1
[START] [2023-12-19 05:14:07] hold_for_later_2
[STOP] [2023-12-19 05:14:07] hold_for_later_2
[START] [2023-12-19 05:14:07] resolve_missing_parents
[STOP] [2023-12-19 05:14:07] resolve_missing_parents
[START] [2023-12-19 05:14:07] rebuild_nodes
[START] [2023-12-19 05:14:07] Flattener#flatten
[START] [2023-12-19 05:14:07] Flattener#study_resource
[START] [2023-12-19 05:14:07] Flattener#build_ancestry
[STOP] [2023-12-19 05:14:09] Flattener#build_ancestry
[INFO] [2023-12-19 05:14:09] 26172 ancestry keys
[START] [2023-12-19 05:14:09] build_node_ancestors
[INFO] [2023-12-19 05:14:09] old ancestors deleted.
[STOP] [2023-12-19 05:14:13] build_node_ancestors
[START] [2023-12-19 05:14:19] Flattener#propagate_ancestor_ids
[STOP] [2023-12-19 05:14:22] Flattener#propagate_ancestor_ids
[STOP] [2023-12-19 05:14:22] Flattener#flatten
[STOP] [2023-12-19 05:14:22] rebuild_nodes
[START] [2023-12-19 05:14:22] resolve_missing_media_owners
[STOP] [2023-12-19 05:14:22] resolve_missing_media_owners
[START] [2023-12-19 05:14:22] sanitize_media_verbatims
[STOP] [2023-12-19 05:14:22] sanitize_media_verbatims
[START] [2023-12-19 05:14:22] queue_downloads
[STOP] [2023-12-19 05:14:22] queue_downloads
[START] [2023-12-19 05:14:22] parse_names
[WARN] [2023-12-19 05:14:22] I see 26172 names which still need to be parsed.
[INFO] [2023-12-19 05:14:23] 0% of media downloaded
[WARN] [2023-12-19 05:14:24] Names to parse: 10000 formatted: 10000 learned: 9551 parsed: 10000
[INFO] [2023-12-19 05:14:25] 0% of media downloaded
[WARN] [2023-12-19 05:14:33] Names to parse: 10000 formatted: 10000 learned: 9830 parsed: 10000
[WARN] [2023-12-19 05:14:43] Names to parse: 6172 formatted: 6172 learned: 6132 parsed: 6172
[STOP] [2023-12-19 05:14:49] parse_names
[START] [2023-12-19 05:14:49] denormalize_canonical_names_to_nodes
[STOP] [2023-12-19 05:14:50] denormalize_canonical_names_to_nodes
[START] [2023-12-19 05:14:50] match_nodes
[START] [2023-12-19 05:14:51] map_all_nodes_to_pages
[STOP] [2023-12-19 05:31:52] map_all_nodes_to_pages
[INFO] [2023-12-19 05:31:52] 5794 Unmatched nodes (of 26172)! That's too many to output. Full list in /app/public/data/bio_imgs_dw_bioi/unmatched_nodes.txt ; First 10: Canonical: Noctuidae; Node#145313461; ResourceID: BI-taxon-100; Canonical: Lacanobia suasa; Node#145315354; ResourceID: BI-taxon-108132; Canonical: Agrochola lychnidis; Node#145315414; ResourceID: BI-taxon-108534; Canonical: Plusiinae; Node#145324168; ResourceID: BI-taxon-15403; Canonical: Shargacucullia verbasci; Node#145327519; ResourceID: BI-taxon-162102; Canonical: Pseudoips prasinana; Node#145327534; ResourceID: BI-taxon-162118; Canonical: Mythimna ferrago; Node#145330214; ResourceID: BI-taxon-27828; Canonical: Chilodes maritimus; Node#145332887; ResourceID: BI-taxon-37419; Canonical: Cucullia gnaphalii occidentalis; Node#145332891; ResourceID: BI-taxon-37426; Canonical: Euxoa obelisca grisea; Node#145332895; ResourceID: BI-taxon-37439
[START] [2023-12-19 05:31:52] update_nodes
[STOP] [2023-12-19 05:32:08] update_nodes
[STOP] [2023-12-19 05:32:08] match_nodes
[START] [2023-12-19 05:32:08] reindex_search
[STOP] [2023-12-19 05:32:38] reindex_search
[START] [2023-12-19 05:32:38] normalize_units
[STOP] [2023-12-19 05:32:38] normalize_units
[START] [2023-12-19 05:32:38] calculate_statistics
[INFO] [2023-12-19 05:35:01] Duplicate page_id count: 0
[STOP] [2023-12-19 05:35:01] calculate_statistics
[START] [2023-12-19 05:35:02] complete_harvest_instance
[START] [2023-12-19 05:35:02] overall_tsv_creation
[INFO] [2023-12-19 05:35:02] Exporting 26172 nodes as TSV in batches of 10000...
[INFO] [2023-12-19 05:35:02] Processing group of 26172 in 3 batches of 10000
[INFO] [2023-12-19 05:40:24] Processed 10000/26172 nodes
[INFO] [2023-12-19 05:47:31] Processed 20000/26172 nodes
[INFO] [2023-12-19 05:51:25] Processed 26172/26172 nodes
[INFO] [2023-12-19 05:51:25] Average Time: 234.143
[INFO] [2023-12-19 05:51:25] Total Time: 16m23s
[STOP] [2023-12-19 05:51:25] overall_tsv_creation
[INFO] [2023-12-19 05:51:25] Done. Check your files:
[INFO] [2023-12-19 05:51:25] (26172 lines) /app/public/data/bio_imgs_dw_bioi/publish_nodes.tsv
[INFO] [2023-12-19 05:51:25] (88391 lines) /app/public/data/bio_imgs_dw_bioi/publish_node_ancestors.tsv
[INFO] [2023-12-19 05:51:25] (26172 lines) /app/public/data/bio_imgs_dw_bioi/publish_scientific_names.tsv
[INFO] [2023-12-19 05:51:25] (128523 lines) /app/public/data/bio_imgs_dw_bioi/publish_media.tsv
[INFO] [2023-12-19 05:51:25] (18082 lines) /app/public/data/bio_imgs_dw_bioi/publish_articles.tsv
[INFO] [2023-12-19 05:51:26] (18330 lines) /app/public/data/bio_imgs_dw_bioi/publish_image_info.tsv
[INFO] [2023-12-19 05:51:26] (275128 lines) /app/public/data/bio_imgs_dw_bioi/publish_attributions.tsv
[INFO] [2023-12-19 05:51:26] (18082 lines) /app/public/data/bio_imgs_dw_bioi/publish_content_sections.tsv
[STOP] [2023-12-19 05:51:27] complete_harvest_instance
[START] [2023-12-19 05:51:27] completed
[STOP] [2023-12-19 05:51:27] completed
[STOP] [2023-12-19 05:51:27] logged process, took 2636.57

Latest Process