I hereby claim:
- I am ganesh-srinivas on github.
- I am gsrinivas (https://keybase.io/gsrinivas) on keybase.
- I have a public key ASBDA9vJrkbic6qLa_R93POWmmLCcuxmF_pKgkeo32ZGXwo
To claim this, I am signing this object:
I hereby claim:
To claim this, I am signing this object:
This document will document progress, ideas and source code for dark data extraction systems. These systems use statistical inference to perform data extraction, integration and cleaning from unstructured/"dark" sources (forum posts, webpages, etc.). Data programming is the predominant paradigm for dark data extraction: noisy/conflicting user-defined functions are supplied to a generative model, which can recover the parameters of labelling process. Wherever possible, my projects are based on Snorkel/DeepDive.
Ideas (Extensions for the system):
Ideas (Applications):
https://github.com/ganesh-srinivas/laughter/
UPDATE: This project was deemed successful, and I received a very positive evaluation from my mentors! :-) (you can view it at http://ganesh-srinivas.github.io/gsoc_final_evaluation.pdf)
The main deliverables from this project are machine learning classifiers that can perform laughter detection and categorization: identify if an audio clip contains laughter or not, and categorize the laughter (giggle, baby laugh, chuckle/chortle, snicker, belly laugh).
| Model Architecture | Input Feature | Output pooling | Test set metrics |
|---|