Jan-18-2022, 01:47 AM
Part 1.B: Accomplished (Prepping my 1611 scrapes [folders & files]) files list; for parsing the HTML to Python and then Payload to MariaDB (Part 2)
Source/Tutorials:
https://askubuntu.com/questions/537967/a...-using-sed
IRC Network - Libera.Chat - #linux:
loganlee: sed append syntax - all lines w/ "/index.html" using the above domain's example with a sed error (with me trying to make it work).
Solution:
Does anyone know how to cp -R using a list like 1611_3.txt ?
This way I can extract a copy of only the 1611 folders (the website is rather huge and the 1611 portion is much smaller than the whole WGET download copy).
Once I get it extracted; I should be able to start writing the python script where I will be needing help, I am sure!
Thank you everyone for this forum!
Best Regards,
Brandon Kastning
Source/Tutorials:
https://askubuntu.com/questions/537967/a...-using-sed
IRC Network - Libera.Chat - #linux:
loganlee: sed append syntax - all lines w/ "/index.html" using the above domain's example with a sed error (with me trying to make it work).
Solution:
WGET-11.02.2021.www.kingjamesbibleonline.org/sed_1611_3$ sed 's/\(.*\)/\1\/index.html/g' 1611_3.txt >1611_4_sed4.txtSample of "1611_4_sed4.txt":
www.kingjamesbibleonline.org/Luke-Chapter-24_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Iohn_13_1611/index.html www.kingjamesbibleonline.org/The-Epistle-to-the-Romanes_12_1611/index.html www.kingjamesbibleonline.org/Reuelation_21_1611/index.html www.kingjamesbibleonline.org/Prouerbs_22_1611/index.html www.kingjamesbibleonline.org/Ecclesiastes_3_1611/index.html www.kingjamesbibleonline.org/1-Corinthians_13_1611/index.html www.kingjamesbibleonline.org/Psalmes_16_1611/index.html www.kingjamesbibleonline.org/Ephesians_5_1611/index.html www.kingjamesbibleonline.org/Discussion-Thread-101611/index.html www.kingjamesbibleonline.org/John-Chapter-15_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Discussion-Thread-161149/index.html www.kingjamesbibleonline.org/Revelation-Chapter-22_Original-1611-KJV/index.html www.kingjamesbibleonline.org/2-Samuel-Chapter-22_Original-1611-KJV/index.html www.kingjamesbibleonline.org/1-Chronicles-Chapter-16_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Job-Chapter-19_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Job-Chapter-22_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Psalms-Chapter-23_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Romans-Chapter-12_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Revelation-Chapter-21_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Galatians-Chapter-6_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Isaiah-Chapter-26_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Colossians-Chapter-3_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Lamentations-Chapter-3_Original-1611-KJV/index.html www.kingjamesbibleonline.org/2-Corinthians-Chapter-3_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Luke-Chapter-12_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Psalms-Chapter-5_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Romans-Chapter-3_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Psalms-Chapter-84_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Isaiah-Chapter-64_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Romans-Chapter-6_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Hebrews-Chapter-4_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Matthew-Chapter-16_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Psalms-Chapter-9_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Matthew-Chapter-19_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Acts-Chapter-20_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Mark-Chapter-9_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Matthew-Chapter-22_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Hebrews-Chapter-13_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Romans-Chapter-10_Original-1611-KJV/index.html www.kingjamesbibleonline.org/James-Chapter-4_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Psalms-Chapter-55_Original-1611-KJV/index.html www.kingjamesbibleonline.org/1-Corinthians-Chapter-15_Original-1611-KJV/index.html www.kingjamesbibleonline.org/Psalms-Chapter-103_Original-1611-KJV/index.htmlNow I need to learn how to take the original list of directories (not with the appended /index.html) -- using this list for the python script, beautifulsoup4 and loops for *parsing*.
Does anyone know how to cp -R using a list like 1611_3.txt ?
This way I can extract a copy of only the 1611 folders (the website is rather huge and the 1611 portion is much smaller than the whole WGET download copy).
Once I get it extracted; I should be able to start writing the python script where I will be needing help, I am sure!
Thank you everyone for this forum!
Best Regards,
Brandon Kastning
“And one of the elders saith unto me, Weep not: behold, the Lion of the tribe of Juda, the Root of David, hath prevailed to open the book,...” - Revelation 5:5 (KJV)
“And oppress not the widow, nor the fatherless, the stranger, nor the poor; and ...” - Zechariah 7:10 (KJV)
#LetHISPeopleGo
“And oppress not the widow, nor the fatherless, the stranger, nor the poor; and ...” - Zechariah 7:10 (KJV)
#LetHISPeopleGo