site:enseignement:master:bdle:annales
Différences
Ci-dessous, les différences entre deux révisions de la page.
Les deux révisions précédentesRévision précédenteProchaine révision | Révision précédente | ||
site:enseignement:master:bdle:annales [21/11/2018 22:02] – amine | site:enseignement:master:bdle:annales [20/11/2019 07:57] (Version actuelle) – amine | ||
---|---|---|---|
Ligne 1: | Ligne 1: | ||
====== Recueil d' | ====== Recueil d' | ||
===== Partie MR et Spark ===== | ===== Partie MR et Spark ===== | ||
+ | ==== Novembre 2019 ==== | ||
+ | {{ : | ||
+ | ==== Septembre 2018 ==== | ||
+ | {{ : | ||
==== Novembre 2017 ==== | ==== Novembre 2017 ==== | ||
+ | === Exercice 1 === | ||
+ | == Données == | ||
+ | <code bash> | ||
+ | nom: | ||
+ | nom: | ||
+ | nom: | ||
+ | nom: | ||
+ | </ | ||
+ | == Programme scala == | ||
+ | <code scala> | ||
+ | val data = sc.textFile(path+" | ||
+ | |||
+ | case class attribut(cle: | ||
+ | |||
+ | def parseElem(in: | ||
+ | val tmp = in.split(":" | ||
+ | | ||
+ | } | ||
+ | |||
+ | parseTuple(in: | ||
+ | |||
+ | val parsed = data.map(x=> | ||
+ | |||
+ | parsed.collect.foreach(println) | ||
+ | |||
+ | case class attribut(cle: | ||
+ | |||
+ | def parseElem(in: | ||
+ | val tmp = in.split(":" | ||
+ | | ||
+ | } | ||
+ | |||
+ | def parseTuple(in: | ||
+ | |||
+ | |||
+ | |||
+ | val parsed = data.map(x=> | ||
+ | |||
+ | /*non posée*/ | ||
+ | def mergeListAttributes(lat1: | ||
+ | ... | ||
+ | } | ||
+ | |||
+ | |||
+ | val synthese = parsed.reduce(mergeListAttributes) | ||
+ | |||
+ | |||
+ | scala> parsed.collect.foreach(println) | ||
+ | List(attribut(age, | ||
+ | List(attribut(niveau, | ||
+ | List(attribut(age, | ||
+ | List(attribut(nom, | ||
+ | |||
+ | |||
+ | </ | ||
=== Exercice 2 : Algèbre Dataset de Spark === | === Exercice 2 : Algèbre Dataset de Spark === | ||
== Données == | == Données == | ||
- | < | + | < |
n1,p1,m1 | n1,p1,m1 | ||
m1,p2,n2 | m1,p2,n2 | ||
Ligne 15: | Ligne 74: | ||
m2,isa,tt3 | m2,isa,tt3 | ||
</ | </ | ||
+ | == Requêtes == | ||
- | < | + | < |
- | /*preparation*/ | + | //preparation |
case class Triple(sujet: | case class Triple(sujet: | ||
val triples = sc.textFile(someFile). | val triples = sc.textFile(someFile). | ||
Ligne 26: | Ligne 86: | ||
val t1 = triples.withColumnRenamed(" | val t1 = triples.withColumnRenamed(" | ||
- | |||
val t2 = triples.withColumnRenamed(" | val t2 = triples.withColumnRenamed(" | ||
- | |||
val t3 = triples.withColumnRenamed(" | val t3 = triples.withColumnRenamed(" | ||
- | |||
val res = t1.join(t2, " | val res = t1.join(t2, " | ||
- | |||
Ligne 53: | Ligne 109: | ||
join(t_types.withColumnRenamed(" | join(t_types.withColumnRenamed(" | ||
join(t_types.withColumnRenamed(" | join(t_types.withColumnRenamed(" | ||
- | </scala> | + | </code> |
site/enseignement/master/bdle/annales.1542834139.txt.gz · Dernière modification : de amine