### Abstract

We consider the problem of identifying underlying community-like structures in graphs. Toward this end, we study the stochastic block model (SBM) on k-clusters: a random model on n = km vertices, partitioned in k equal sized clusters, with edges sampled independently across clusters with probability q and within clusters with probability p, p > q. The goal is to recover the initial “hidden” partition of [n]. We study semidefinite programming (SDP)-based algorithms in this context. In the regime (formula presented), we show that a certain natural SDP-based algorithm solves the problem of exact recovery in the k-community SBM, with high probability, whenever (formula presented), as long as k= o(log n). This threshold is known to be the information theoretically optimal. We also study the case when (formula presented). In this case however, we achieve recovery guarantees that no longer match the optimal condition (formula presented), thus leaving achieving optimality for this range an open question.

Original language | English (US) |
---|---|

Title of host publication | Applied and Numerical Harmonic Analysis |

Publisher | Springer International Publishing |

Pages | 125-162 |

Number of pages | 38 |

Edition | 9783319698014 |

DOIs | |

State | Published - Jan 1 2017 |

### Publication series

Name | Applied and Numerical Harmonic Analysis |
---|---|

Number | 9783319698014 |

ISSN (Print) | 2296-5009 |

ISSN (Electronic) | 2296-5017 |

### Fingerprint

### Keywords

- Dual certificate
- Graph partitioning
- Random models
- Semidefinite programming
- Stochastic block model

### ASJC Scopus subject areas

- Applied Mathematics

### Cite this

*Applied and Numerical Harmonic Analysis*(9783319698014 ed., pp. 125-162). (Applied and Numerical Harmonic Analysis; No. 9783319698014). Springer International Publishing. https://doi.org/10.1007/978-3-319-69802-1_4

**Multisection in the stochastic block model using semidefinite programming.** / Agarwal, Naman; Bandeira, Afonso; Koiliaris, Konstantinos; Kolla, Alexandra.

Research output: Chapter in Book/Report/Conference proceeding › Chapter

*Applied and Numerical Harmonic Analysis.*9783319698014 edn, Applied and Numerical Harmonic Analysis, no. 9783319698014, Springer International Publishing, pp. 125-162. https://doi.org/10.1007/978-3-319-69802-1_4

}

TY - CHAP

T1 - Multisection in the stochastic block model using semidefinite programming

AU - Agarwal, Naman

AU - Bandeira, Afonso

AU - Koiliaris, Konstantinos

AU - Kolla, Alexandra

PY - 2017/1/1

Y1 - 2017/1/1

N2 - We consider the problem of identifying underlying community-like structures in graphs. Toward this end, we study the stochastic block model (SBM) on k-clusters: a random model on n = km vertices, partitioned in k equal sized clusters, with edges sampled independently across clusters with probability q and within clusters with probability p, p > q. The goal is to recover the initial “hidden” partition of [n]. We study semidefinite programming (SDP)-based algorithms in this context. In the regime (formula presented), we show that a certain natural SDP-based algorithm solves the problem of exact recovery in the k-community SBM, with high probability, whenever (formula presented), as long as k= o(log n). This threshold is known to be the information theoretically optimal. We also study the case when (formula presented). In this case however, we achieve recovery guarantees that no longer match the optimal condition (formula presented), thus leaving achieving optimality for this range an open question.

AB - We consider the problem of identifying underlying community-like structures in graphs. Toward this end, we study the stochastic block model (SBM) on k-clusters: a random model on n = km vertices, partitioned in k equal sized clusters, with edges sampled independently across clusters with probability q and within clusters with probability p, p > q. The goal is to recover the initial “hidden” partition of [n]. We study semidefinite programming (SDP)-based algorithms in this context. In the regime (formula presented), we show that a certain natural SDP-based algorithm solves the problem of exact recovery in the k-community SBM, with high probability, whenever (formula presented), as long as k= o(log n). This threshold is known to be the information theoretically optimal. We also study the case when (formula presented). In this case however, we achieve recovery guarantees that no longer match the optimal condition (formula presented), thus leaving achieving optimality for this range an open question.

KW - Dual certificate

KW - Graph partitioning

KW - Random models

KW - Semidefinite programming

KW - Stochastic block model

UR - http://www.scopus.com/inward/record.url?scp=85042476724&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85042476724&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-69802-1_4

DO - 10.1007/978-3-319-69802-1_4

M3 - Chapter

T3 - Applied and Numerical Harmonic Analysis

SP - 125

EP - 162

BT - Applied and Numerical Harmonic Analysis

PB - Springer International Publishing

ER -