Welcome to Nowab In Data

نواب فى بيانات

Parliament records, reports, written questions, and transcription of debates are rich sources of data for research in various disciplines. Not only do they enable new lines of research in fields such as political science, sociology, gender studies, and information retrieval, but they can also be an important source of data for natural language processing. For example, this data can be used to gain insight into what topics members of parliament and parties discuss and vote on.

In this context, the Moroccan house of representatives website can be very useful, especially since it provides data in Arabic (mainly classic, debate transcript may contain text in Dareja). For this, we created a website where we share parliament datasets the following datasets :

In order to (re-)produce these datasets, we created a Python library called “barlaman”. barlaman is a collection of Python scripts to retrieve data from the Parliament website (web-scraping) and documents (pdf scraping)

Support or Contact

Having trouble with something? Contact me and we’ll help you sort it out.