Make it fit.

Drup · Drup · commit 45e21b732f02 · 2018-06-27T17:53:03.000+02:00
diff --git a/bench.tex b/bench.tex
@@ -83,7 +83,7 @@ \subsection{Comparing Algorithms in the \haskell Implementation}
 The resulting graph plots the time (x-axis) against the number of words (y-axis) produced so far. The slope of the graph indicates the generation speed of the plotted algorithm, high slope is correlated to high generation speed.  \cref{bench:haskell:all} contains the results for the Haskell implementations.
  
 Most algorithms generate between $1.3\cdot10^3$ and $1.4\cdot10^6$ words in the first
-second, which seems more than sufficient for testing purposes.
+second, which seems sufficient for testing purposes.
 The \textbf{refConv} implementation
 which uses symbolic segments and convolutions is consistently in the
 leading group.
diff --git a/improvements.tex b/improvements.tex
@@ -25,6 +25,7 @@ \subsection{Faster Concatenation by Convolution}
         $ zipWith (liftA2 T.append) lx rly')
       : collect ysegs rly'
 \end{lstlisting}
+\vspace{-\baselineskip}
   \caption{Concatenation with convolution}
   \label{fig:concatenation-with-convolution}
 \end{figure}
diff --git a/measure/haskell_all.gnuplot b/measure/haskell_all.gnuplot
@@ -2,7 +2,7 @@
 
 # set terminal x11 size 1500,500 font 'Deja Vu Sans Mono,14' persist
 
-set terminal pngcairo transparent size 1000,2000 rounded font 'Deja Vu Sans,19'
+set terminal pngcairo transparent size 1000,1900 rounded font 'Deja Vu Sans,19'
 set output 'haskell_all.png'
 
 # set terminal tikz standalone size 15,6 textscale 0.5
diff --git a/measure/haskell_all.png b/measure/haskell_all.png
diff --git a/measure/langs.gnuplot b/measure/langs.gnuplot
@@ -2,7 +2,7 @@
 
 # set terminal x11 size 700,700 font 'Deja Vu Sans Mono,14' persist
 
-set terminal pngcairo transparent size 2000,700 rounded font 'Deja Vu Sans,18'
+set terminal pngcairo transparent size 2000,650 rounded font 'Deja Vu Sans,18'
 set output 'langs.png'
 
 # set terminal tikz standalone size 15,6 textscale 0.5
diff --git a/measure/langs.png b/measure/langs.png
diff --git a/measure/ocaml_all.gnuplot b/measure/ocaml_all.gnuplot
@@ -2,7 +2,7 @@
 
 # set terminal x11 size 1500,500 font 'Deja Vu Sans Mono,14' persist
 
-set terminal pngcairo transparent size 1000,2000 rounded font 'Deja Vu Sans,19'
+set terminal pngcairo transparent size 1000,1900 rounded font 'Deja Vu Sans,19'
 set output 'ocaml_all.png'
 
 # set terminal tikz standalone size 15,6 textscale 0.5
diff --git a/measure/ocaml_all.png b/measure/ocaml_all.png
diff --git a/motivation.tex b/motivation.tex
@@ -241,6 +241,7 @@ \subsection{McIlroy's Approach}
   then star xt
   else T.empty : concatenate lx (star lx)
 \end{lstlisting}
+\vspace{-\baselineskip}
   \caption{McIlroy's implementation of regular operators}
   \label{fig:regular-operators-0}
 \end{figure}
@@ -298,6 +299,7 @@ \subsection{Extending McIlroy}
   where
     lsigmastar = star (map T.singleton sigma)
 \end{lstlisting}
+\vspace{-\baselineskip}
   \caption{Additional operations in McIlroy's framework}
   \label{fig:more-regular-operators}
 \end{figure}
@@ -398,6 +400,7 @@ \section{Generation by Cross Section}
     combine n i =
       liftA2 T.append (lx !! i) (ly !! (n - i))
 \end{lstlisting}
+\vspace{-\baselineskip}
       % [T.append x y 
       % | x <- lx !! i, y <- ]
   \caption{Concatenation for segment representation}
@@ -445,6 +448,7 @@ \section{Generation by Cross Section}
     combine n i =
       liftA2 T.append (lx !! i) (lstar !! (n - i))
 \end{lstlisting}
+\vspace{-\baselineskip}
     % combine n i =
     %   [T.append x y 
     %   | x <- lx !! i, y <- lstar !! (n - i)]
@@ -467,6 +471,7 @@ \section{Generation by Cross Section}
     extend lsigmai =
       [T.cons a w | a <- sigma, w <- lsigmai]
 \end{lstlisting}
+\vspace{-\baselineskip}
   \caption{Complementation for the segment representation}
   \label{fig:llo-complement}
 \end{figure}
diff --git a/ocaml.tex b/ocaml.tex
@@ -3,7 +3,7 @@ \section{\ocaml Implementation}
 
 \lstset{language=[Objective]Caml}
 
-We also implemented the complete
+We also implemented our
 language generation algorithm in \ocaml.
 % The \ocaml version only implements the ``latest'' version of the
 % algorithm with a segmented representation, fast backward lookup and convolutions
@@ -91,12 +91,12 @@ \section{\ocaml Implementation}
 \autoref{code:sigs:word} contains the signature for words.
 It provides
 the empty word (for \code{One}),
-singleton words (for \code{Atom}), and to append two words.
+singleton words (for \code{Atom}), and append.
 Neither an ordering nor a length operation is needed:
 Comparison is encapsulated in the segment
 data structure and the length of a word is the index of the segment in
 which it appears.
-
+%
 This signature is satisfied by the \ocaml \code{string}
 type (\ie arrays of bytes), arrays, lists of characters, or ropes. The
 type of individual characters is unrestricted.
@@ -106,8 +106,7 @@ \section{\ocaml Implementation}
 \autoref{code:sigs:segment} contains the signature for segments.
 % The first group of operations creates and tests for empty segments and
 % singleton segments. 
-The main requirement is to support the operations on power series as described in \autoref{sec:gener-cross-sect}.
-We also requires the set operations
+The main requirement is to support the operations on power series as described in \autoref{sec:gener-cross-sect} and the set operations
 \code{union}, \code{inter} and \code{inter}.
 %
 The product described in \autoref{eq:1} is decomposed in two parts:
@@ -119,7 +118,7 @@ \section{\ocaml Implementation}
   by invocations of \code{append}.
 \end{itemize}
 %
-Experimentation with transient data-structures require s
+Experimentation with transient data-structures requires
 an explicit \code{memoize} function that avoids recomputing segments accessed
 multiple times. 
 %
@@ -317,7 +316,7 @@ \subsection{Data Structures}
 %
 Such a memoization function incurs a linear cost on enumerations. To test
 if this operation is worthwhile we implemented two modules:
-\code{ThunkList} where memoization is the identity and \code{ThunkListMemo}
+\code{ThunkList} without memoization and \code{ThunkListMemo}
 with the implementation described above.
 
 \paragraph{Lazy Lists}
@@ -338,8 +337,8 @@ \subsection{Data Structures}
 
 As the main operations on segments are set operations, one might 
 expect a set implementation to perform well. We implemented segments as sets
-of words using \ocaml's built-in \code{Set} module. \ocaml sets are implemented
-using balanced binary trees.
+of words using \ocaml's built-in \code{Set} module which relies on
+balanced binary trees.
 The only operations not implemented by \ocaml's standard library are
 the n-way merge and the product.
 %, which can be implemented using folds and unions.
@@ -351,38 +350,29 @@ \subsection{Data Structures}
 as maps from words to values where a word belongs to its domain if there is a
 path reaching a value labeled with the characters in the word.
 Tries seem well adapted to our problem:
-\begin{itemize}[leftmargin=*]
-\item As all words in a segment have the same length, we only need values at the leaves.
-  % As no prefixes need to be represented.
-\item The \code{append} operation on tries can be implemented by
-  grafting the second trie to all the leaves of the first one.
-\end{itemize}
-
+since all words in a segment have the same length, we only need values at the leaves.
+%   % As no prefixes need to be represented.
+% \item The \code{append} operation on tries can be implemented by
+%   grafting the second trie to all the leaves of the first one.
+% \end{itemize}
+%
 Hence, we can implement tries like tries of integers \cite{Okasaki98fastmergeable}.
 For simplicity, we do not use path compression, which means
 that branches are always labeled with one character.
-A trie is either \code{Empty}, a \code{Leaf} containing a value, or a \code{Node} containing a map from characters
+A trie is either \code{Empty}, a \code{Leaf} or a \code{Node} containing a map from characters
 to its child tries.
 % As we are only interested in the paths, we consider tries
 % without values. 
-
-\begin{lstlisting}
-type trie =
-  | Empty
-  | Leaf
-  | Node of trie CharMap.t
-\end{lstlisting}
-
+%
 % The implementation of most operations follows the literature.
 The only novel operation is \code{append} which computes the product of two sets.
-As we only store values at the leaves,
-it can be implemented in a single traversal which will graft the appended trie
-\code{t0} at each leaf of \code{t}, without copies.
+It can be implemented in a single traversal which grafts the
+appended trie \code{t0} at each leaf of \code{t}, without copies.
 
 \begin{lstlisting}
+type trie = Empty | Leaf | Node of trie CharMap.t
 let rec append t t0 = match t with
-  | Empty -> Empty
-  | Leaf -> t0
+  | Empty -> Empty | Leaf -> t0
   | Node map -> 
     CharMap.map (fun t' -> append t' t0) map
 \end{lstlisting}