class: center, middle, inverse, title-slide # The role of Statistics in the world of Big Data ### Salvador, 2018 --- class: class: center, middle # Data janitor <img src="imgs/jtrecenti.png" width="90%" style="display: block; margin: auto;" /> --- class: center, middle <img src="imgs/cursor.png" width="90%" style="display: block; margin: auto;" /> --- class: middle, center ### Food for thought <img src="abertura_amostra_files/figure-html/unnamed-chunk-3-1.png" width="100%" style="display: block; margin: auto;" /> --- ## New era? - Data variety increased. - Important data today: <img src="imgs/friends.jpg" width="23%" /><img src="imgs/acordao.png" width="23%" /><img src="imgs/wave.png" width="23%" /> --- ## Deep Learning - Recent popularity and hype. - Astonishing case studies. - Different terminology <img src="imgs/deepl.png" width="70%" style="display: block; margin: auto;" /> --- ## Problems - Many (too many?) people using it acritically. -- - Market is demanding. -- - We don't learn that in undergrad courses (or do we?) -- ### Are statisticians becoming obsolete? --- class: inverse, middle, center # Depression --- ## Marketing and venn diagrams <img src="imgs/diff.png" width="90%" style="display: block; margin: auto;" /> --- ## Marketing and venn diagrams <img src="imgs/dsvenn1.png" width="80%" style="display: block; margin: auto;" /> --- ## Marketing and venn diagrams <img src="imgs/dsvenn3.png" width="80%" style="display: block; margin: auto;" /> --- ## Depression <img src="imgs/quit.png" width="90%" style="display: block; margin: auto;" /> --- ## Depression 2 <img src="imgs/dead.png" width="90%" style="display: block; margin: auto;" /> --- class:inverse ## Don't panic! <img src="imgs/panic.jpg" width="70%" style="display: block; margin: auto;" /> --- ## Don't panic! - The definition of Data Science does not really matter. -- - What we should discuss is how to actually **DO** data science <img src="imgs/data-science.png" width="90%" style="display: block; margin: auto;" /> --- ## Don't panic! ### There are many false cognates. -- ### Our course IS useful. -- ### We need to adapt our mindset and develop ourselves. --- ## Logistic regression <img src="imgs/glm.png" width="90%" height="80%" style="display: block; margin: auto;" /> `$$\mathbb E(Y_i) = g^{-1}(\alpha + x_i\beta)$$` --- ## Deep learning <img src="imgs/y1.png" width="100%" style="display: block; margin: auto;" /> `$$f(x) = \sigma(wx + b)$$` -- - Coincidence? --- class: inverse, center, middle # What's my role? --- class: center, middle # we give data science direction -- In a Gradient Descent optimization, the **gradient** gives the direction, and the **learning rate** gives the size of the step. `$$\beta_{\text{new}} = \beta_{\text{old}} - \alpha\nabla_\beta(\text{loss})$$` -- In data science, statisticians' role is to be the **gradient**, and computer scientists' role is to be the **learning rate**. `$$DS_{\text{new}} = DS_{\text{old}} - cs * stat (errors)$$` --- ## Where we are <img src="imgs/img01.png" width="90%" style="display: block; margin: auto;" /> --- ## Including data science <img src="imgs/img02.png" width="90%" style="display: block; margin: auto;" /> --- class: center, middle # What we want? --- ## More registered statisticians (CONRE) <img src="imgs/img03.png" width="90%" style="display: block; margin: auto;" /> --- ## More undergrad courses, less evasion <img src="imgs/img04.png" width="90%" style="display: block; margin: auto;" /> --- ## Work with other data scientists <img src="imgs/img05.png" width="90%" style="display: block; margin: auto;" /> --- class: inverse, center, middle # What should we do? --- # What do you want? -- ## LEARN (L) - Learn many things, write papers, exercise your curiosity -- ## RESOLVE (R) - Earn money, create your startup, raise a family -- ## SHARE (S) - Share your profession, be relevant on the web, help the community --- # Focus -- ## 1. [LSR] Integrate communities -- ## 2. [SLR] Be relevant on the web -- ## 3. [LRS] Study, learn, update, use R (and python) -- ## 4. [RSL] Use what the university give to you -- ## 5. [RLS] Be relevant in the university --- # Stalk me - CONRE-3: [jtrecenti@conre3.org.br](mailto:jtrecenti@conre3.org.br) Pages: - https://curso-r.com - https://abj.org.br Slides: https://jtrecenti.github.io/slides/ufba-rt/ -- <img src="imgs/causality.jpg" width="90%" style="display: block; margin: auto;" />