Python Web Scraping for Threat Intelligence

Arya Adhi Nugraha(1*), Domy Kristomo(2),

(1) Universitas Teknologi Digital Indonesia (UTDI), Indonesia
(2) Universitas Teknologi Digital Indonesia (UTDI), Indonesia
(*) Corresponding Author

Abstract


The relentless evolution of cyber threats poses significant challenges to organizations striving to maintain robust cybersecurity defenses. In this context, the effective gathering and analysis of threat intelligence data play a crucial role in enhancing situational awareness and informing proactive security measures. This journal entry explores the utilization of Python web scraping techniques for threat intelligence purposes, with a focus on extracting valuable insights from the Cybersecurity and Infrastructure Security Agency (CISA) website. Through the development and implementation of a Python script for web scraping, the process of systematically gathering threat intelligence data is examined, highlighting the efficacy of automation in streamlining the collection and analysis of real-time threat data. The results demonstrate the effectiveness of the Python script in facilitating the rapid aggregation of threat intelligence from diverse online sources, providing security professionals with actionable insights to strengthen their cybersecurity defenses. Additionally, considerations regarding the ethical and legal implications of web scraping are addressed, emphasizing the importance of responsible data collection practices. Overall, this exploration of Python web scraping for threat intelligence underscores its potential as a valuable tool for enhancing cybersecurity resilience in the face of evolving cyber threats.

Full Text:

PDF

References


Cremer, F., Sheehan, B., Fortmann, M. et al. Cyber risk and cybersecurity: a systematic review of data availability. Geneva Pap Risk Insur Issues Pract 47, (2022), pp. 698–736.

Cybercrime Magazine, Cybercrime to cost the world $10.5 trillion annually by 2025, 2020, [Online]. Available: https://cybersecurityventures.com/hackerpocalypse-cybercrime-report-2016/.

Cybersecurity and Infrastructure Security Agency (CISA). (n.d.). Retrieved from https://www.cisa.gov.

Richardson, L., & Ruby, S. (2007). RESTful web services. O'Reilly Media, Inc.

McKinney, W. (2017). Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython. O'Reilly Media, Inc.

Mitchell, R. L., & Sammes, A. J. (1994). Web crawler engineering. Springer.

BeautifulSoup Documentation. (n.d.). Retrieved from https://www.crummy.com/software/BeautifulSoup/bs4/doc/.

Python Requests Documentation. (n.d.). Retrieved from https://docs.python-requests.org/en/latest/.

Python Official Documentation. (n.d.). Retrieved from https://docs.python.org/3/

Laakmann McDowell, G. (2019). Cracking the Coding Interview: 189 Programming Questions and Solutions. CareerCup.




DOI: https://doi.org/10.30645/kesatria.v5i2.381

DOI (PDF): https://doi.org/10.30645/kesatria.v5i2.381.g378

Refbacks

  • There are currently no refbacks.


Published Papers Indexed/Abstracted By: