Harvest for
SoundCloud
Created
10 Jul 16:30
Stage:
completed
Fetched:
10 Jul 16:30
Validated:
10 Jul 16:30
Deltas Created
10 Jul 16:30
Units Normalized:
10 Jul 16:30
Ancestry Built:
10 Jul 16:30
Nodes Matched:
10 Jul 16:30
Names Parsed:
10 Jul 16:30
New Models Stored:
10 Jul 16:30
Indexed:
10 Jul 16:30
Completed:
10 Jul 16:30
Time to Harvest:
less than a minute
Harvesting Log
(133 lines)
# Logfile created on 2019-07-10 16:30:03 -0400 by logger.rb/56815
[START] [2019-07-10 16:30:03] logged process
[START] [2019-07-10 16:30:03] create_harvest_instance
[STOP] [2019-07-10 16:30:04] create_harvest_instance
[START] [2019-07-10 16:30:04] fetch_files
[STOP] [2019-07-10 16:30:04] fetch_files
[START] [2019-07-10 16:30:04] validate_each_file
[STOP] [2019-07-10 16:30:04] validate_each_file
[START] [2019-07-10 16:30:04] convert_to_csv
[CMD] [2019-07-10 16:30:04] /usr/bin/sort /app/public/converted_csv/sound_cloud_agents_14142.csv > /app/public/converted_csv/sound_cloud_agents_14142.csv_sorted
[CMD] [2019-07-10 16:30:04] /usr/bin/sort /app/public/converted_csv/sound_cloud_nodes_14143.csv > /app/public/converted_csv/sound_cloud_nodes_14143.csv_sorted
[CMD] [2019-07-10 16:30:04] /usr/bin/sort /app/public/converted_csv/sound_cloud_media_14144.csv > /app/public/converted_csv/sound_cloud_media_14144.csv_sorted
[STOP] [2019-07-10 16:30:04] convert_to_csv
[START] [2019-07-10 16:30:04] calculate_delta
[CMD] [2019-07-10 16:30:04] echo "0a" > /app/public/diff/sound_cloud_agents_14142.diff
[CMD] [2019-07-10 16:30:04] tail -n +1 /app/public/converted_csv/sound_cloud_agents_14142.csv >> /app/public/diff/sound_cloud_agents_14142.diff
[CMD] [2019-07-10 16:30:04] echo "." >> /app/public/diff/sound_cloud_agents_14142.diff
[CMD] [2019-07-10 16:30:04] echo "0a" > /app/public/diff/sound_cloud_nodes_14143.diff
[CMD] [2019-07-10 16:30:04] tail -n +1 /app/public/converted_csv/sound_cloud_nodes_14143.csv >> /app/public/diff/sound_cloud_nodes_14143.diff
[CMD] [2019-07-10 16:30:04] echo "." >> /app/public/diff/sound_cloud_nodes_14143.diff
[CMD] [2019-07-10 16:30:04] echo "0a" > /app/public/diff/sound_cloud_media_14144.diff
[CMD] [2019-07-10 16:30:04] tail -n +1 /app/public/converted_csv/sound_cloud_media_14144.csv >> /app/public/diff/sound_cloud_media_14144.diff
[CMD] [2019-07-10 16:30:04] echo "." >> /app/public/diff/sound_cloud_media_14144.diff
[STOP] [2019-07-10 16:30:04] calculate_delta
[START] [2019-07-10 16:30:04] parse_diff_and_store
[INFO] [2019-07-10 16:30:04] Loading agents diff file into memory (true lines)...
[INFO] [2019-07-10 16:30:04] Loading nodes diff file into memory (true lines)...
[INFO] [2019-07-10 16:30:04] Loading media diff file into memory (true lines)...
[INFO] [2019-07-10 16:30:04] Storing 1 Attributions
[INFO] [2019-07-10 16:30:04] Processing group of 1 in 1 groups of 1000
[INFO] [2019-07-10 16:30:04] Average Time: 0.0
[INFO] [2019-07-10 16:30:04] Total Time: 1s
[INFO] [2019-07-10 16:30:04] Storing 5 ScientificNames
[INFO] [2019-07-10 16:30:04] Processing group of 5 in 1 groups of 1000
[INFO] [2019-07-10 16:30:04] Average Time: 0.01
[INFO] [2019-07-10 16:30:04] Total Time: 1s
[INFO] [2019-07-10 16:30:04] Storing 5 Nodes
[INFO] [2019-07-10 16:30:04] Processing group of 5 in 1 groups of 1000
[INFO] [2019-07-10 16:30:04] Average Time: 0.01
[INFO] [2019-07-10 16:30:04] Total Time: 1s
[INFO] [2019-07-10 16:30:04] Storing 5 Media
[INFO] [2019-07-10 16:30:04] Processing group of 5 in 1 groups of 1000
[INFO] [2019-07-10 16:30:04] Average Time: 0.0
[INFO] [2019-07-10 16:30:04] Total Time: 1s
[STOP] [2019-07-10 16:30:04] parse_diff_and_store
[START] [2019-07-10 16:30:04] resolve_keys
[INFO] [2019-07-10 16:30:09] Occurrences to nodes (through scientific_names)...
[INFO] [2019-07-10 16:30:09] traits to occurrences...
[INFO] [2019-07-10 16:30:09] traits to nodes (through occurrences)...
[INFO] [2019-07-10 16:30:09] Traits to sex term...
[INFO] [2019-07-10 16:30:09] Traits to lifestage term...
[INFO] [2019-07-10 16:30:09] MetaTraits to traits...
[INFO] [2019-07-10 16:30:09] MetaTraits (simple, measurement row refers to parent) to traits...
[INFO] [2019-07-10 16:30:09] Assocs to occurrences...
[INFO] [2019-07-10 16:30:09] Assocs to nodes...
[INFO] [2019-07-10 16:30:09] Assoc to sex term...
[INFO] [2019-07-10 16:30:09] Assoc to lifestage term...
[STOP] [2019-07-10 16:30:09] resolve_keys
[START] [2019-07-10 16:30:09] hold_for_later_1
[STOP] [2019-07-10 16:30:09] hold_for_later_1
[START] [2019-07-10 16:30:09] hold_for_later_2
[STOP] [2019-07-10 16:30:09] hold_for_later_2
[START] [2019-07-10 16:30:09] resolve_missing_parents
[STOP] [2019-07-10 16:30:09] resolve_missing_parents
[START] [2019-07-10 16:30:09] rebuild_nodes
[START] [2019-07-10 16:30:09] Flattener#flatten
[START] [2019-07-10 16:30:09] Flattener#study_resource
[START] [2019-07-10 16:30:09] Flattener#build_ancestry
[STOP] [2019-07-10 16:30:09] Flattener#build_ancestry
[INFO] [2019-07-10 16:30:09] 5 ancestry keys
[START] [2019-07-10 16:30:09] build_node_ancestors
[INFO] [2019-07-10 16:30:09] old ancestors deleted.
[STOP] [2019-07-10 16:30:09] build_node_ancestors
[WARN] [2019-07-10 16:30:09] Flattener: nothing to flatten! (Completely flat resource?)
[STOP] [2019-07-10 16:30:09] Flattener#flatten
[STOP] [2019-07-10 16:30:09] rebuild_nodes
[START] [2019-07-10 16:30:09] resolve_missing_media_owners
[STOP] [2019-07-10 16:30:09] resolve_missing_media_owners
[START] [2019-07-10 16:30:09] sanitize_media_verbatims
[STOP] [2019-07-10 16:30:09] sanitize_media_verbatims
[START] [2019-07-10 16:30:09] queue_downloads
[STOP] [2019-07-10 16:30:09] queue_downloads
[START] [2019-07-10 16:30:09] parse_names
[WARN] [2019-07-10 16:30:09] I see 5 names which still need to be parsed.
[ERR] [2019-07-10 16:30:09][hdls] download_and_prep FAILED for Medium.find(10051941): 401 Unauthorized
[ERR] [2019-07-10 16:30:10][hdls] http://api.soundcloud.com/tracks/48811632/download?client_id=ac6cdf58548a238e00b7892c031378ce is audio/x-wav, which is unsupported. Medium.find(10051940) resource: SoundCloud (823), PK: 48811632
[ERR] [2019-07-10 16:30:10][hdls] download_and_prep FAILED for Medium.find(10051940): http://api.soundcloud.com/tracks/48811632/download?client_id=ac6cdf58548a238e00b7892c031378ce is audio/x-wav, which is unsupported. Medium.find(10051940) resource: SoundCloud (823), PK: 48811632
[ERR] [2019-07-10 16:30:10][hdls] http://api.soundcloud.com/tracks/35233897/download?client_id=ac6cdf58548a238e00b7892c031378ce is audio/x-wav, which is unsupported. Medium.find(10051938) resource: SoundCloud (823), PK: 35233897
[ERR] [2019-07-10 16:30:10][hdls] download_and_prep FAILED for Medium.find(10051938): http://api.soundcloud.com/tracks/35233897/download?client_id=ac6cdf58548a238e00b7892c031378ce is audio/x-wav, which is unsupported. Medium.find(10051938) resource: SoundCloud (823), PK: 35233897
[STOP] [2019-07-10 16:30:10] parse_names
[START] [2019-07-10 16:30:10] denormalize_canonical_names_to_nodes
[STOP] [2019-07-10 16:30:10] denormalize_canonical_names_to_nodes
[START] [2019-07-10 16:30:10] match_nodes
[START] [2019-07-10 16:30:10] map_all_nodes_to_pages
[ERR] [2019-07-10 16:30:10][hdls] http://api.soundcloud.com/tracks/35321675/download?client_id=ac6cdf58548a238e00b7892c031378ce is audio/x-wav, which is unsupported. Medium.find(10051939) resource: SoundCloud (823), PK: 35321675
[ERR] [2019-07-10 16:30:10][hdls] download_and_prep FAILED for Medium.find(10051939): http://api.soundcloud.com/tracks/35321675/download?client_id=ac6cdf58548a238e00b7892c031378ce is audio/x-wav, which is unsupported. Medium.find(10051939) resource: SoundCloud (823), PK: 35321675
[ERR] [2019-07-10 16:30:10][hdls] http://api.soundcloud.com/tracks/35180212/download?client_id=ac6cdf58548a238e00b7892c031378ce is audio/x-wav, which is unsupported. Medium.find(10051937) resource: SoundCloud (823), PK: 35180212
[STOP] [2019-07-10 16:30:10] map_all_nodes_to_pages
[ERR] [2019-07-10 16:30:10][hdls] download_and_prep FAILED for Medium.find(10051937): http://api.soundcloud.com/tracks/35180212/download?client_id=ac6cdf58548a238e00b7892c031378ce is audio/x-wav, which is unsupported. Medium.find(10051937) resource: SoundCloud (823), PK: 35180212
[INFO] [2019-07-10 16:30:10] Unmatched nodes (1 of 5): Grus canadensis (#44148133)
[START] [2019-07-10 16:30:10] update_nodes
[STOP] [2019-07-10 16:30:10] update_nodes
[STOP] [2019-07-10 16:30:10] match_nodes
[START] [2019-07-10 16:30:10] reindex_search
[STOP] [2019-07-10 16:30:10] reindex_search
[START] [2019-07-10 16:30:10] normalize_units
[STOP] [2019-07-10 16:30:10] normalize_units
[START] [2019-07-10 16:30:10] calculate_statistics
[STOP] [2019-07-10 16:30:10] calculate_statistics
[START] [2019-07-10 16:30:10] complete_harvest_instance
[START] [2019-07-10 16:30:10] overall_tsv_creation
[INFO] [2019-07-10 16:30:10] Processing group of 5 in 1 batches of 10000
[INFO] [2019-07-10 16:30:42] Average Time: 11.09
[INFO] [2019-07-10 16:30:42] Total Time: 32s
[STOP] [2019-07-10 16:30:42] overall_tsv_creation
[INFO] [2019-07-10 16:30:42] Done. Check your files:
[INFO] [2019-07-10 16:30:42] (5 lines) /app/public/data/sound_cloud/publish_nodes.tsv
[INFO] [2019-07-10 16:30:42] (5 lines) /app/public/data/sound_cloud/publish_scientific_names.tsv
[INFO] [2019-07-10 16:30:42] (5 lines) /app/public/data/sound_cloud/publish_media.tsv
[STOP] [2019-07-10 16:30:42] complete_harvest_instance
[START] [2019-07-10 16:30:42] completed
[STOP] [2019-07-10 16:30:42] completed
[STOP] [2019-07-10 16:30:42] logged process, took 38.93
[ERR] [2019-07-11 09:44:54][hdls] download_and_prep FAILED for Medium.find(10051937): undefined local variable or method `raw' for #<MediumPrepper::SaveAndServe:0x00005623ba396a78>
Did you mean? rand
[ERR] [2019-07-11 09:44:54][hdls] download_and_prep FAILED for Medium.find(10051938): undefined local variable or method `raw' for #<MediumPrepper::SaveAndServe:0x00005623b7cea780>
Did you mean? rand
[ERR] [2019-07-11 09:44:55][hdls] download_and_prep FAILED for Medium.find(10051939): undefined local variable or method `raw' for #<MediumPrepper::SaveAndServe:0x00005623ba5ca948>
Did you mean? rand
[ERR] [2019-07-11 09:44:56][hdls] download_and_prep FAILED for Medium.find(10051940): undefined local variable or method `raw' for #<MediumPrepper::SaveAndServe:0x00005623b9f69400>
Did you mean? rand
[ERR] [2019-07-11 09:44:56][hdls] download_and_prep FAILED for Medium.find(10051941): 401 Unauthorized
[ERR] [2019-07-11 10:39:21][hdls] download_and_prep FAILED for Medium.find(10051941): 401 Unauthorized
Latest Process