Skip to content

Commit

Permalink
Reorganized folder structure
Browse files Browse the repository at this point in the history
  • Loading branch information
bfeist committed Apr 4, 2015
1 parent c431b00 commit 0f3507b
Show file tree
Hide file tree
Showing 79 changed files with 14 additions and 11 deletions.
17 changes: 10 additions & 7 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -18,8 +18,8 @@ Ben Feist ([email protected])
<li>Process CSV OCR output with Python scripts (multiple one-time operations to clean various issues) (complete) (more here http://benfeist.com/digitizing-apollo-17-part-5-python-processing/ )</li>
<li>Reconstruct entire mission timeline in Adobe Premiere laying in air-to-ground audio from Internet Archive and television video from NASA History office. (complete - pending last 5% of source material digitized by JSC Audio Lab). (more here http://benfeist.com/digitizing-apollo-17-part-6-timeline-reconstruction/ ) All Premiere source footage can be obtained using Bittorrent Sync. Sync hash key: BBHUF5R7EPR5N3KPHBSHHWNHZYAB5YY2V</li>
<li>Listen to reconstruction timeline. Correct transcript of each utterance including timestamp, transcriptions errors from 1972, and OCR errors. (complete) (more here http://benfeist.com/digitizing-apollo-17-part-7-listening-in-real-time/ )</li>
<li>Render all Premiere Pro video segments that were created for timecode purposes, and upload all 36, 8 hour segments (125GB) to YouTube. YouTube Channel containing these videos: https://www.youtube.com/channel/UC3pGYbJCfrINT1DNBJMxC2Q/videos
<li>Generate HTML output from corrected utterance CSV. (complete)</li>
<li>Render all Premiere Pro video segments that were created for timecode purposes, and upload all 39, 8 hour segments (125GB) to YouTube. YouTube Channel containing these videos: https://www.youtube.com/channel/UC3pGYbJCfrINT1DNBJMxC2Q/videos
<li>Generate HTML transcript from corrected utterance CSV. (complete)</li>
</ol>
Current Status:
<ul>
Expand All @@ -28,15 +28,18 @@ Current Status:
Future Steps:
<ul>
<li>Generate HTML for AFJ</li>
<li>Generate MC output for Spacelog.org</li>
<li>Generate MC output for Spacelog.org</li>
<li>Integrate imagery into apollo17.org</li>
</ul>

The "HTML_scripts/output" folder contains the apollo17.org website itself in alpha form.
The "_Website/_webroot" folder contains the apollo17.org website itself.

The "Abbyy Image OCR" folder contains a project that can be opened using ABBYY FineReader 11 Pro. This is where the body of the conversion work is being done.
The "! Previous Steps/OCR/Abbyy Image OCR" folder contains a project that can be opened using ABBYY FineReader 11 Pro. This is where the body of the conversion work is being done.

The "OCR_Output" folder contains pipe-delimited CSV files that are direct outputs from FineReader. These CSV files are quite dirty, as many as 100 pages were completely misread by finereader due to the typing being tilted in the scans.
The "! Previous Steps/OCR/OCR_Output" folder contains pipe-delimited CSV files that are direct outputs from FineReader. These CSV files are quite dirty, as many as 100 pages were completely misread by finereader due to the typing being tilted in the scans.

The "Processing_Scripts" folder contains Python scripts that were written to scrub the OCR CSV output. These scripts are changed often to assist with whatever portion of the cleaning process is currently being addressed. They perform tasks such as timestamp processing, checking of callsigns, merging dialog lines that are split across pages in the typewritten originals, etc.

The "MC_Output" folder contains the output from scripts like makeTEC_MCFromRawCSV.py in the "Processing_Scripts" folder. This "MC" output is a format that's usable by Spacelog.org.
The "_MC_Output" folder contains the output from scripts like makeTEC_MCFromRawCSV.py in the "Processing_Scripts" folder. This "MC" output is a format that's usable by Spacelog.org.

The "_AFJ" folder contains the output of all transcript data into HTML compatible with the Apollo Flight Journal.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
8 changes: 4 additions & 4 deletions HTML_Scripts/createAllHTML.py → _Website/createAllHTML.py
Original file line number Diff line number Diff line change
Expand Up @@ -2,14 +2,14 @@
import csv
from quik import FileLoader

output_TOC_file_name_and_path = "./output/TOC.html"
output_TOC_file_name_and_path = "./_webroot/TOC.html"
output_TOC_file = open(output_TOC_file_name_and_path, "w")
output_TOC_file.write("")
output_TOC_file.close()

output_TOC_file = open(output_TOC_file_name_and_path, "a")

output_TOC_index_file_name_and_path = "./output/TOCindex.csv"
output_TOC_index_file_name_and_path = "./_webroot/TOCindex.csv"
output_TOC_index_file = open(output_TOC_index_file_name_and_path, "w")
output_TOC_index_file.write("")
output_TOC_index_file.close()
Expand Down Expand Up @@ -49,14 +49,14 @@


## -------------------- Write Utterance HTML
output_utterance_file_name_and_path = "./output/allUtterances.html"
output_utterance_file_name_and_path = "./_webroot/allUtterances.html"
output_utterance_file = open(output_utterance_file_name_and_path, "w")
output_utterance_file.write("")
output_utterance_file.close()

output_utterance_file = open(output_utterance_file_name_and_path, "a")

output_utterance_index_file_name_and_path = "./output/utteranceIndex.csv"
output_utterance_index_file_name_and_path = "./_webroot/utteranceIndex.csv"
output_utterance_index_file = open(output_utterance_index_file_name_and_path, "w")
output_utterance_index_file.write("")
output_utterance_index_file.close()
Expand Down
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
Binary file added _Website/wireframes.pptx
Binary file not shown.

0 comments on commit 0f3507b

Please sign in to comment.