Skip to content

Jack005/bigbang

 
 

Repository files navigation

BigBang

BigBang is a toolkit for studying communications data from collaborative projects.

The focus of the early milestones will be on mailing list or listserve analysis, with a focus on Mailman and Sourceforge mail archives, as well as .mbox files.

Installation

BigBang uses a lot of SciPy packages that use native (e.g. C) code. This complicates installation.

There are two package management systems you can use for installing Python projects. I recommend using conda for installation. You can also use pip.

You will need to install Graphviz separately, as this is currently not supportedcross-platform either by pip or Anaconda.

conda installation

The Anaconda Python distribution which comes with scientific packages pre-installed and the conda package management system.

  1. Install Graphviz based on the website's instructions for your operating system.

  2. Clone this repository:

    git clone git@github.com:sbenthall/bigbang.git

  3. Make a new conda environment called bigbang

    conda create -n bigbang python

  4. Run this script to install the dependencies

    cd bigbang

    bash conda-setup.sh

You should be good to go.

pip installation

You're going to have to manually install a bunch of programs if you do it this way. You can try following these instructions for installing various Scientific Python packages using Homebrew and pip.

These are some of the packages you will need to install. You can discover others ones you will need be seeing where pip chokes.

When you think you've got what you need installed, follow these instructions:

  1. Clone this repo.

  2. (Optional) Make a new virtualenv.

  3. Install the remaining dependencies to the virtual environment using pip.

    pip install -r requirements.txt

  4. cd into the directory of the cloned repo and run

    python setup.py develop

Collecting from Mailman

BigBang comes with a script for collecting files from public Mailman web archives. And example of this is the scipy-dev mailing list page.

From the directory of the this checked out repository (i.e. you may need to cd bigbang), you can collect the archives from a web URL with the following command:

python bin/collect_mail.py -u http://mail.scipy.org/pipermail/scipy-dev/

You can also give this command a file with several urls, one per line. One of these is provided in the examples/ directory.

python bin/collect_mail.py -f examples/urls.txt

Once the data has been collected, BigBang has functions to support analysis.

Using BigBang

BigBang is presently an environment for scientific exploration of mailing list data.

The best way to learn about how to use BigBang is to look at the I Python notebook examples provided in this repository. In the home directory of this repository, run:

source activate bigbang
ipython notebook examples/

and play around.

About

listserve analysis toolkit

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 99.4%
  • Shell 0.6%