Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Page Not Found

Page not found. Your pixels are in another canvas.

Jupyter notebook markdown generator

Posts

Demanding Leisure in the Time of COVID

5 minute read

Published: April 05, 2021

November 2020 was the lowest time for me during the COVID-19 pandemic. My boyfriend left to spend a month with his faimly across the country, while I stayed home in our apartment. Aside from a few days visiting my sister, I spent the month without human contact.

Practicing Essentialism in the Tech Industry

8 minute read

Published: February 14, 2021

Hello readers! It’s been a long time since I posted on this blog. Years, actually. During that time, I finished my PhD at UC Berkeley and started working at Pinterest. It was an adjustment, to say the least. I traded my 10 minute walk to campus for an hourlong bus ride across the Bay Bridge; freedom to work from anywhere with Wifi for butt-in-chair from nine to five; the pursuit of knowledge for the pursuit of measurable business impact.

How I Traveled the World in Grad School

6 minute read

Published: March 16, 2019

If you follow me on social media, you might’ve seen that I’ve been traveling a ton this past year, and most of it has been related to my grad school work. In my five years as a PhD student, I’ve visited five states and five countries for conferences and other events. As someone who didn’t travel much as a kid, I’ve been loving these opportunities!

Random Problems with R

less than 1 minute read

Published: January 22, 2019

I gave a lightning talk at the SF R Ladies meet-up about a problem with R’s sampling algorithm. Check out my slides here!

Internet voting, security, and transparency at E-VOTE-ID

7 minute read

Published: October 08, 2018

Last week, I attended my first voting conference: E-VOTE-ID. I’ve presented at statistics conferences before but never an interdisciplinary one like E-VOTE-ID. It brought together people working on electronic voting issues from a whole range of disciplines: legal studies, sociology, cryptography and security, voting systems developers, former election officials, and one statistician. This guy!

From Paper to Program: Challenges of Implementing Permutation Tests

1 minute read

Published: June 12, 2018

I gave a talk about a short book I’m writing at the 4th Conference of the International Society of Nonparametric Statistics. Please check out my slides!

My grad school “crisis of faith” and how I got through it

8 minute read

Published: April 16, 2018

A few years ago, I was pretty unhappy in my PhD program.

Open Source Licenses Explained

3 minute read

Published: March 26, 2018

I ALWAYS forget to put a license on my work until someone reminds me. I’ve learned over and over that it’s important, but I think the reason why it hasn’t stuck is that I was never taught why it’s important.

Why There Isn’t More Evidence That Pesticides Disrupt Your Hormones

9 minute read

Published: December 08, 2017

I’ve really been gotten on the crunchy bandwagon this year – buying high quality grassfed meats, organic produce, paraben-free beauty products, and swapping out plastic food storage containers for glass ones. Up until recently, I was skeptical about the evidence that these choices really make a difference for your health.

Questions to Consider about Your Product from a VC Partner

4 minute read

Published: November 27, 2017

I participated in my first hackathon two weekends ago. I use code to do data analysis most of the time, not write apps or websites. For me, it was more of a fun learning experience and I got to see what kinds of work are expected and rewarded.

Why do people lose interest in academic careers during grad school?

6 minute read

Published: October 24, 2017

Fewer grad students are on the job market for faculty positions – is it because they realize that there are fewer jobs or because they are genuinely more interested in other career paths? Roach and Sauermann studied interest in academic careers in a way that has never been done before: longitudinal surveys of current graduate students. By giving people the survey twice, once in their first or second year of the PhD and then again three years later, they are able to measure changes in interest. Previously, people have only looked at cross-sectional data and compared two groups at different points in their PhD.

Uncommon Hypothesis Tests to Debunk Common Misconceptions

less than 1 minute read

Published: September 28, 2017

I gave a talk about p-values and hypothesis testing at BIDS. Please check out my slides!

How Well Did I Follow Pedagogy Guidelines at R Bootcamp 2017?

7 minute read

Published: August 21, 2017

This week I had the privilege of participating in two workshops: I was a participant at a train-the-trainer workshop to become a Software Carpentry instructor and an instructor at the R Bootcamp put on by the Statistics Department and D-Lab. It was a unique opportunity to spend two days learning how to teach one of these bootcamps, and then to put my skills to the test a few days later.

Embedding Python plotly figures in markup

2 minute read

Published: August 04, 2017

A lightweight markup language is a simple, human-readable language for formatting text. It’s easy to read and compatible with most text editors. Documents written in lightweight markup are usually then converted to things that are harder for people, but easier for computers, to read, like HTML. The most common ones that I’ve heard of people using are Markdown, R Markdown, and reStructured Text. I imagine that most people who do data analysis/exploratory visualization/data science use a markup language more often than they write in raw HTML.

Which logistic regression method in Python should I use?

6 minute read

Published: July 28, 2017

This question is related to my last blog post about what people consider when choosing which Python package to use. Say I want to use some statistical method. I have a few options. I could code it up from scratch myself, knowing that this might have undetected bugs and be pretty slow. I could Google what I’m looking for and use the first thing I find; similarly, there are no guarantees. Or, I could do my research, find all the packages that seem to offer what I’m looking for, and decide which looks best based on how thoroughly they’ve documented and tested their code.

What’s important when vetting open source packages?

6 minute read

Published: July 21, 2017

I’m in the early stages of creating several Python packages right now (shameless self plug – see permute, cryptorandom, and pscore_match). I want people to actually use them when they’re ready. They have potential for wide use, but they have narrow functionality compared to big packages like numpy or scipy. I could imagine that somebody looking to do a particular task in Python, like propensity score matching, would do a Google search and stumble upon my package.

Simple Random Sampling: Not So Simple

4 minute read

Published: February 03, 2017

This blog was originally posted on the BIDS blog, here.

Gender Issues Roundtable Discussion: A Case Study in Uncomfortable Conversations

14 minute read

Published: December 14, 2016

This blog was originally posted on the BIDS blog, here, and was written with Rebecca Barter, Ryan Giordano, and Sara Stoudt.

Bringing Data Science Back to Statistics

6 minute read

Published: February 16, 2016

This blog post originally appeared on the BIDS blog.

portfolio

Model-based Matching

Published: May 01, 2015

Is Salt Bad for Nations?

Published: October 15, 2023

Student Evaluations of Teaching (Mostly) Do Not Measure Teaching Effectiveness

Published: October 15, 2023

publications

Student Evaluations of Teaching (Mostly) Do Not Measure Teaching Effectiveness

Published: January 07, 2016

We showed that student evaluations of teaching are biased against female instructors, and it is not possible to adjust for the bias due to its dependence on many other factors.

Download here

permute: a Python Package for Permutation Methods

Published: June 28, 2016

permute is a Python package for permutation testing.

Download here

Simple Random Sampling: Not So Simple

Published: June 13, 2017

We propose several best practices for researchers using PRNGs for simulations, including the wide adoption of hash function based PRNGs.

Nonparametric Risk Attribution for Factor Models of Portfolios

Published: October 03, 2017

We explain a two-step process for partitioning the risk of projected returns into contributions from latent factors using nonparametric regression methods.

An Empirical Comparison of Parametric and Permutation Tests for Regression Analysis of Randomized Experiments

Published: October 10, 2017

We compared traditional ANCOVA to permutation approaches in the analysis of randomized experiments.

Download here

The Practice of Reproducible Research: Case Studies and Lessons from the Data-Intensive Sciences

Published: October 17, 2017

I contributed two case studies of my data science workflow.

Download here

Next Steps for the Colorado Risk-Limiting Audit (CORLA) Program

Published: March 02, 2018

We proposed a strategy for Colorado to conduct risk-limiting post-election audits of contests spanning jurisdictions that use heterogeneous voting systems.

Download here

Random Problems with R

Published: September 18, 2018

Faulty algorithms in R’s random sampling functions.

Download here

The Risk Limit of Bayesian Audits

Published: October 02, 2018

I presented this work at the E-VOTE-ID 2018 PhD Colloquium.

Download here

Risk-Limiting Audits by Stratified Union-Intersection Tests of Elections (SUITE)

Published: October 05, 2018

We’ve developed a new statistical method for risk-limiting post-election audits of stratified samples of ballots.

Download here

Bernoulli Ballot Polling: A Manifest Improvement for Risk-Limiting Audits

Published: February 22, 2019

Ballot polling RLAs using Bernoulli sampling, rather than simple random sampling, can bypass current logistical challenges.

Download here

Kellie Ottoboni

Sitemap

Pages

Posts

portfolio

publications

talks

teaching