Add unit tests #99

jiminhsieh · 2019-09-17T12:55:11Z

Fix part of #64.

Finish the rest of the unit tests of WordCountTest, AccumulatorsTest, and MixedDatasetSuite.

In the same time, I add comment mark to ignore coverage of the main method.

jiminhsieh · 2019-09-17T13:01:03Z

src/main/scala/com/high-performance-spark-examples/dataframe/MixedDataset.scala

@@ -89,7 +89,7 @@ class MixedDataset(sqlCtx: SQLContext) {
      Dataset[(RawPanda, CoffeeShop)] = {
    //tag::joinWith[]
    val result: Dataset[(RawPanda, CoffeeShop)] = pandas.joinWith(coffeeShops,
-      $"zip" === $"zip")
+      pandas("zip") === coffeeShops("zip"))


Below log is the reason I need to change as above:

[info] - self join *** FAILED *** [info] org.apache.spark.sql.AnalysisException: Reference 'zip' is ambiguous, could be: zip#178, zip#195.; [info] at org.apache.spark.sql.catalyst.plans.logical.LogicalPlan.resolve(LogicalPlan.scala:287)

jiminhsieh · 2019-09-17T13:01:16Z

src/main/scala/com/high-performance-spark-examples/dataframe/MixedDataset.scala

-    val result: Dataset[(RawPanda, RawPanda)] = pandas.joinWith(pandas,
-      $"zip" === $"zip")
+    val result: Dataset[(RawPanda, RawPanda)] = pandas.as("l").joinWith(pandas.as("r"),
+      $"l.zip" === $"r.zip")


codecov · 2019-09-17T13:05:38Z

Codecov Report

Merging #99 into master will increase coverage by 3.71%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master      #99      +/-   ##
==========================================
+ Coverage   44.76%   48.48%   +3.71%     
==========================================
  Files          39       33       -6     
  Lines         994      959      -35     
  Branches       35       21      -14     
==========================================
+ Hits          445      465      +20     
+ Misses        549      494      -55

Impacted Files	Coverage Δ
...rmance-spark-examples/dataframe/MixedDataset.scala	`100% <100%> (+61.11%)`	⬆️
...formance-spark-examples/native/NativeExample.scala
...-spark-examples/transformations/Accumulators.scala
...performance-spark-examples/ml/CustomPipeline.scala
...rformance-spark-examples/ml/SimpleNaiveBayes.scala
...rformance-spark-examples/dataframe/RawPandas.scala	`80% <0%> (+40%)`	⬆️
...rformance-spark-examples/wordcount/WordCount.scala	`100% <0%> (+50%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update f02142b...da2ca94. Read the comment docs.

jiminhsieh added 5 commits September 17, 2019 09:00

Add rest of unit test of WordCountTest

97c066a

Add reset of unit tests of Accumlators

587eb5d

Ignore coverage of main method

08969b6

Add rest of unit test of MixedDataset

7f35545

Bump the version of sbt-scoverage to 1.6.0

2bf20c6

jiminhsieh commented Sep 17, 2019

View reviewed changes

Fix warning from scalastyle

da2ca94

holdenk merged commit b4c5daf into high-performance-spark:master Dec 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add unit tests #99

Add unit tests #99

jiminhsieh commented Sep 17, 2019

jiminhsieh Sep 17, 2019

jiminhsieh Sep 17, 2019

codecov bot commented Sep 17, 2019 •

edited

Loading

Add unit tests #99

Add unit tests #99

Conversation

jiminhsieh commented Sep 17, 2019

jiminhsieh Sep 17, 2019

Choose a reason for hiding this comment

jiminhsieh Sep 17, 2019

Choose a reason for hiding this comment

codecov bot commented Sep 17, 2019 • edited Loading

Codecov Report

codecov bot commented Sep 17, 2019 •

edited

Loading