<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">

<article article-type="research-article" xmlns:xlink="http://www.w3.org/1999/xlink">
 <front>
    <journal-meta>
	<journal-id journal-id-type="publisher-id">Jemr</journal-id>
      <journal-title-group>
        <journal-title>Journal of Eye Movement Research</journal-title>
      </journal-title-group>
      <issn pub-type="epub">1995-8692</issn>
	  <publisher>								
	  <publisher-name>Bern Open Publishing</publisher-name>
	  <publisher-loc>Bern, Switzerland</publisher-loc>
	</publisher>
    </journal-meta>
    <article-meta>
	<article-id pub-id-type="doi">10.16910/jemr.10.4.6</article-id> 
	  <article-categories>								
				<subj-group subj-group-type="heading">
					<subject>Research Article</subject>
				</subj-group>
		</article-categories>
      <title-group>
        <article-title>Less Users More Confidence: How AOIs Don't Affect Scanpath Trend Analysis</article-title>
      </title-group>
	   <contrib-group> 
				<contrib contrib-type="author">
					<name>
						<surname>Eraslan</surname>
						<given-names>Sukru</given-names>
					</name>
					<xref ref-type="aff" rid="aff1">1</xref>
				</contrib>
				<contrib contrib-type="author">
					<name>
						<surname>Yesilada</surname>
						<given-names>Yeliz</given-names>
					</name>
					<xref ref-type="aff" rid="aff1">1</xref>
				</contrib>				
				<contrib contrib-type="author">
					<name>
						<surname>Harper</surname>
						<given-names>Simon</given-names>
					</name>
					<xref ref-type="aff" rid="aff2">2</xref>
				</contrib>
        <aff id="aff1">
		<institution>Middle East Technical University</institution>,  <addr-line>Northern Cyprus Campus, 99738 Kalkanl&#x131;, Gu&#x308;zelyurt, Mersin 10</addr-line>,  <country>Turkey</country>
        </aff>
		<aff id="aff2">
		<institution>School of Computer Science, University of Manchester</institution>, <addr-line>Manchester, M13 9 PL</addr-line>, <country>United Kingdom</country>
        </aff>
		</contrib-group>
     
	  <pub-date date-type="pub" publication-format="electronic"> 
		<day>22</day>  
		<month>11</month>
        <year>2017</year>
      </pub-date>
	  <pub-date date-type="collection" publication-format="electronic"> 
	  <year>2017</year>
	</pub-date>
      <volume>10</volume>
      <issue>4</issue>
	 <elocation-id>10.16910/jemr.10.4.6</elocation-id> 
	<permissions> 
	<copyright-year>2017</copyright-year>
	<copyright-holder>Eraslan, Yesilada and Harper</copyright-holder>
	<license license-type="open-access">
  <license-p>This work is licensed under a Creative Commons Attribution 4.0 International License, 
  (<ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">
    https://creativecommons.org/licenses/by/4.0/</ext-link>), which permits unrestricted use and redistribution provided that the original author and source are credited.</license-p>
</license>
	</permissions>
      <abstract>
        <p>User studies are typically difficult, recruiting enough users is often problematic and each experiment takes a considerable amount of time to be completed. In these studies, eye tracking is increasingly used which often increases time, therefore, the lower the number of users required for these studies the better for making these kinds of studies more practical in terms of economics and time expended. The possibility of achieving almost the same results with fewer users has already been raised. Specifically, the possibility of achieving 75% similarity to the results of 65 users with 27 users for searching tasks and 34 users for browsing tasks has been observed in scanpath trend analysis which discovers the most commonly followed path on a particular web page in terms of its visual elements or areas of interest (AOIs). Different approaches are available to segment or divide web pages into their visual elements or AOIs. In this paper, we investigate whether the possibility raised by the previous work is restricted to a particular page segmentation approach by replicating the experiments with two other segmentation approaches. The results are consistent with ~5% difference for the searching tasks and ~10% difference for the browsing tasks.</p>
      </abstract>
      <kwd-group>
        <kwd>eye tracking</kwd>
        <kwd>scanpath</kwd>
        <kwd>usability</kwd>
        <kwd>sample size</kwd>
        <kwd>region of interest</kwd>
        <kwd>areas of interest</kwd>
        <kwd>visual elements</kwd>
        <kwd>web pages</kwd>
        <kwd>segmentation</kwd>
        <kwd>trending path</kwd>
        <kwd>STA</kwd>
      </kwd-group>
    </article-meta>
  </front>	
	 
  <body>

    <sec id="S1">
      <title>Open Data</title>

      <p>The dataset used for the experiments is provided in
external online repository at
<ext-link ext-link-type="uri" xlink:href="http://iamdata.cs.manchester.ac.uk/data_files/33." xlink:show="new">http://iamdata.cs.manchester.ac.uk/data_files/33.</ext-link>
It consists of the
saved versions of the web pages used, information sheet,
consent form, questionnaire and individual scanpaths in
terms of the AOIs of the web pages. The repository also
includes all the raw data from the experiments. Besides,
the Python implementation of the Scanpath Trend
Analysis (STA) algorithm can be accessed from
<ext-link ext-link-type="uri" xlink:href="https://github.com/SukruEraslan/sta." xlink:show="new">https://github.com/SukruEraslan/sta.</ext-link></p>
    </sec>
	
    <sec id="S2">
      <title>Introduction</title>
	  
      <p>User studies play an important role in improving user
experience on the web. In these studies, eye tracking has
been widely used for assessing the quality of user
experience on web pages (
        <xref ref-type="bibr" rid="R33">1</xref>
        ). It has also been used for
investigating user interactions with web pages to provide some
directions to improve user experience (
        <xref ref-type="bibr" rid="R34 R35 R36">2-4</xref>
        ). However, as
eye tracking researchers, we are commonly having
difficulties finding users for our studies (
        <xref ref-type="bibr" rid="R37">5</xref>
        ). In an eye tracking
study, a researcher is required to allocate a separate
session for each user, and therefore the study cannot be
carried out in parallel with only one eye tracker and/or one
researcher. Hence, these studies can take a significant
amount of time to be completed. To deal with this issue, it
is important to estimate the ideal number of users to
understand and model user behaviours on the web. Even
though eye tracking is increasingly used in usability
studies to evaluate and improve usability of web pages (
        <xref ref-type="bibr" rid="R33">1</xref>
        ),
existing research does not focus on the effects of the
number of users on data analysis in this field.
Specifically, the number of users required for analysing eye
movement sequences (i.e., scanpaths), which is typically
conducted based on visual elements or areas of interest
(AOIs) of web pages, has not been studied in depth in the
literature (
        <xref ref-type="bibr" rid="R38">6</xref>
        ). Different approaches can be used to divide
or segment web pages into their visual elements or AOIs
manually or automatically (
        <xref ref-type="bibr" rid="R39">7</xref>
        ). Researchers can apply an
automated approach to discover AOIs of web pages
which typically uses the source code with some
heuristics. Researchers can also manually define their AOIs
based on their goals. Specifically, if they are interested in
certain elements on web pages (such as, advertisements),
they can directly define them as their AOIs. However,
there is no investigation of possible effects of AOIs on the
required number of users for scanpath analysis. In the rest
of the paper, we will refer to the process of identifying
AOIs or visual elements on web pages as segmentation,
thus the AOIs or elements will be referred to as segments.</p>

      <p>It has been shown that there is a possibility of
achieving almost the same results with fewer users, especially
achieving 75% similarity to the results of 65 users with 27
users for searching tasks and 34 users for browsing tasks
in scanpath trend analysis (STA) (
        <xref ref-type="bibr" rid="R38">6</xref>
        ). The STA algorithm
is designed to identify the most commonly followed path
as trending path on a particular web page in terms of the
AOIs of the page (
        <xref ref-type="bibr" rid="R40">8</xref>
        ). The trending path provided by the
STA algorithm can be used to improve the web
experience of users in constrained environments, especially
visually disabled users (
        <xref ref-type="bibr" rid="R40">8</xref>
        ). In particular, web pages can
be adapted for visually disabled users by making
commonly used areas more accessible in the desired order
such that the users can directly access these areas with
their screen readers without spending unnecessary time
on clutter (
        <xref ref-type="bibr" rid="R36">4</xref>
        ).</p>
		
      <p>To investigate the possibility of achieving almost the
same results with fewer users in scanpath trend analysis,
only the extended and improved version of the
Visionbased Page Segmentation (VIPS) approach was used by
Eraslan, Yesilada (
        <xref ref-type="bibr" rid="R38">6</xref>
        ) to automatically segment the web
pages into their AOIs (
        <xref ref-type="bibr" rid="R41">9</xref>
        ). However, the selection of a
page segmentation approach may affect the possibility of
achieving the same results with fewer users in scanpath
analysis. Therefore, in this paper, we replicate the
experiments from Eraslan, Yesilada (
        <xref ref-type="bibr" rid="R38">6</xref>
        ) with the same eye
tracking dataset by using different segmentation
approaches to investigate whether this possibility is
restricted to a particular segmentation approach.</p>

      <p>As an example, Figure 1 shows the Babylon page with
its AOIs automatically discovered by the extended and
improved version of the VIPS approach (
        <xref ref-type="bibr" rid="R41">9</xref>
        ). The VIPS
approach segments web pages based on both their source
code and visual representation, and relates their elements
to the underlying source code (
        <xref ref-type="bibr" rid="R41">9</xref>
        ). Figure 1 also illustrates
a scanpath of a particular user on the Babylon page where
the circles represent the points fixated by the user and the
largest circle shows the longest fixation (
        <xref ref-type="bibr" rid="R42">10</xref>
        ). As
illustrated in the figure, scanpath analysis allows the investigation
of which elements catch attention and which paths are
followed in terms of the elements (
        <xref ref-type="bibr" rid="R40">8</xref>
        ).</p>
		
<fig id="fig01" fig-type="figure" position="float">
					<label>Figure 1.</label>
					<caption>
						<p>A user scanpath on the Babylon web page that is segmented into its AOIs by the VIPS approach</p>
					</caption>
					<graphic id="graph01" xlink:href="jemr-10-04-f-figure-01.png"/>
				</fig>		
		
      <p>To investigate the validity of the possibility of
achieving almost the same results with fewer users with other
page segmentation approaches, we firstly reviewed the
literature to find appropriate segmentation approaches for
our study. However, we could find only a few public
segmentation approaches. This is a common problem in
computer science as highlighted by Collberg and
Proebsting (
        <xref ref-type="bibr" rid="R43">11</xref>
        ). Thus, we could replicate the experiments
with two other page segmentation approaches. As an
alternative to the VIPS approach, we used the
Block-oMatic (BOM) approach for the automatic segmentation
(
        <xref ref-type="bibr" rid="R39 R44">7, 12</xref>
        ). The BOM approach also segments web pages
based on both their source code and visual representation,
but uses an alternative hybrid algorithm to do that. As
researchers can also prefer to segment web pages
manually based on their goals (
        <xref ref-type="bibr" rid="R45">13</xref>
        ), in addition to the BOM
automatic segmentation approach (
        <xref ref-type="bibr" rid="R44">12</xref>
        ), we decided to apply
a user driven approach to manually segment the web
pages into their AOIs. For the manual segmentation, we
again used the same dataset (
        <xref ref-type="bibr" rid="R38">6</xref>
        ) because the participants
were asked to draw what they remember about the layout
of the web pages on a piece of paper. This method is
called recall. As it is explained by Johnson (
        <xref ref-type="bibr" rid="R46">14</xref>
        ), visual
cues are very important to support users to recognise
where they are. This recall approach helps us to
understand which visual elements/AOIs of the pages really
supported that. Those recalled by the participants were
more likely to support their tasks. After the participants
were asked to draw what they recall, we combined their
drawings to manually segment the pages into their
elements. We could also segment the web pages manually by
ourselves but we used the recall approach to be more
objective. If we drew the segments by ourselves, this
would reflect our particular view of the segments on the
pages which would not be objective. By using the recall
method, we discovered how the participants generally
segmented the pages in their minds and how the design of
the pages supported that. Since they drew the segments
based on what they remember about the pages, the drawn
segments would have taken their attention, thus could be
valuable in scanpath trend analysis.</p>

      <p>The rest of this paper firstly discusses the related
work, secondly explains our methodology including the
STA algorithm, the eye tracking study, the segmentation
approaches used and our analysis procedure, then presents
the results along with their discussion and finally provides
the conclusions.</p>
    </sec>
	
    <sec id="S3">
      <title>Related Work</title>
	  
      <p>Usability studies are conducted to evaluate a particular
product, such as a web page. Usability experts can
observe how users interact with a particular product to
complete certain tasks and/or they can examine the product to
investigate possible problems that can be experienced by
users (
        <xref ref-type="bibr" rid="R47">15</xref>
        ). The number of users required for usability
studies has been a debatable issue for more than 30 years
(
        <xref ref-type="bibr" rid="R47 R48">15, 16</xref>
        ). Specific numbers have already been suggested
but these numbers are typically related to the studies
which attempt to identify usability problems of a
particular product (
        <xref ref-type="bibr" rid="R38">6</xref>
        ). The most popular example is from
Nielsen and Landauer (
        <xref ref-type="bibr" rid="R49">17</xref>
        ) who suggest that it is possible
to discover 85% of usability problems of a particular
product with only five users. In contrast, Faulkner (
        <xref ref-type="bibr" rid="R50">18</xref>
        )
suggests that this is not valid for all five users. Based on
her study, 15 users are required instead of five users.
Another example is from Hwang and Salvendy (
        <xref ref-type="bibr" rid="R51">19</xref>
        ) who
analysed many published research papers since 1990 and
suggest that at least 10&#xB1;2 users are needed to discover
80% of usability problems. However, there is also a
considerable number of researchers who argue that each
study has its own features, thus the number of users
cannot be unique for all studies (
        <xref ref-type="bibr" rid="R47 R52 R53 R54">15, 20-22</xref>
        ). For instance, the
complexity of products and/or the characteristics of users
can cause differences in the required number of users (
        <xref ref-type="bibr" rid="R55 R56">23, 24</xref>
        ).</p>
		
      <p>Existing research in this field does not focus on eye
tracking studies which are commonly used to evaluate
and improve usability of web pages (
        <xref ref-type="bibr" rid="R33">1</xref>
        ). Since eye
tracking studies are usually time-consuming, it is crucial for
researchers to estimate the ideal sample size for their
studies. Pernice and Nielsen (
        <xref ref-type="bibr" rid="R57">25</xref>
		) suggest to have 39 users
for a stable heat map to highlight which parts of web
pages get more attention. However, as shown in Figure 2,
these maps do not show eye movement sequences and
they are much easier to analyse in comparison with
scanpaths. Therefore, we should not assume that this number
is also appropriate for scanpath analysis. Moreover,
Pernice and Nielsen (
        <xref ref-type="bibr" rid="R57">25</xref>
		) suggest that a qualitative study
can be conducted with six users and then their eye
movements can be watched. Although researchers may gain
valuable insights for determining which fixation features
(such as, fixation duration, fixation count, time to first
fixation, etc.) would be relevant for their further analyses
by watching eye movements of users, the details are
likely to be lost as an eye tracking dataset typically consists
of many fixations. Therefore, detailed data analysis is
crucial and becomes critical compared to qualitative
analysis. Furthermore, eye tracking data analysis also needs to
be related to certain behavioural tasks. Otherwise, the
analysis cannot be completed to understand the user
behaviour.</p>

<fig id="fig02" fig-type="figure" position="float">
					<label>Figure 2.</label>
					<caption>
						<p>A heat map on the Babylon web page</p>
					</caption>
					<graphic id="graph02" xlink:href="jemr-10-04-f-figure-02.png"/>
				</fig>

      <p>The effects of the number of users on scanpath
analysis were investigated by using a scanpath analysis
algorithm called Scanpath Trend Analysis (STA) on six web
pages segmented with the VIPS approach (
        <xref ref-type="bibr" rid="R38">6</xref>
        ). This
algorithm discovers the trending scanpath on a particular page
in terms of its AOIs (
        <xref ref-type="bibr" rid="R40">8</xref>
        ). This path is different from an
absolute path that is shared by all users. Other existing
approaches typically try to discover an absolute path (26).
However, their resulting paths tend to be very short
because of the variations caused by individual differences,
and therefore they typically have low similarities to the
user scanpaths (
        <xref ref-type="bibr" rid="R40 R58">8, 26</xref>
        ). In comparison with other
approaches, the STA algorithm provides the path with the
highest similarities to the user scanpaths, thus it would be
more helpful for behaviour analysis (
        <xref ref-type="bibr" rid="R40">8</xref>
        ). The details of the
STA algorithm is provided in the following section.</p>

      <p>The study with the STA algorithm showed that it is
possible to achieve almost the same results with a less
number of users (
        <xref ref-type="bibr" rid="R38">6</xref>
        ). Specifically, there is a possibility of
achieving 75% similarity to the results of 65 users with 27
users for searching tasks and 34 users for browsing tasks.
However, the study did not show whether this possibility
is valid with other page segmentation approaches. Thus,
we ask whether it matters how we segment a web page.</p>
    </sec>
	
    <sec id="S4">
      <title>Methodology</title>
	  
      <p>This section firstly gives a brief information about the
STA algorithm, secondly explains our eye tracking study,
and then describes the segmentation approaches used and
our analysis procedure.</p>

      <sec id="S4a">
        <title>STA: Scanpath Trend Analysis</title>
		
        <p>The STA algorithm has the following three stages: (1) 
Preliminary Stage, (2) First Pass, and (3) Second Pass.
These stages are briefly described below. The full
description of the STA algorithm can be found in (
          <xref ref-type="bibr" rid="R40">8</xref>
          ).</p>
		  
        <p><bold>Preliminary Stage.</bold> The algorithm initially takes a
series of fixations for each user on a web page and the AOIs
of the page. It then finds the corresponding AOI for each
fixation to represent the user scanpaths in terms of the
AOIs. For example, the scanpath in Figure 1 is
represented as MPPQ as the user fixated M, P, again P and Q
respectively. The durations of these fixations are also
stored.</p>

        <p><bold>First Pass.</bold> The aim of the first pass is to analyse the
user scanpaths for the discovery of the trending AOIs
based on the total number of fixations and the total
duration of fixations (dwell time) on the AOIs. A user can
fixate the same AOI more than once consecutively (such
as, M<bold>PP</bold>Q) and/or non-consecutively (such as, M<bold>P</bold>Q<bold>P</bold>).
We refer to each non-consecutive visit as an instance. For
example, there are two instances of P in M<bold>P</bold>Q<bold>P</bold>. As the
STA algorithm performs sequential analysis, these
instances should be differentiated. To differentiate the
instances of the same AOI, different numbers are assigned
to them where the longest instance gets the first number.
For example, if a user fixates M, M, P, M and M for 100
ms, 200 ms, 200 ms, 400 ms and 200 ms respectively,
his/her scanpath is represented as M2 [100 ms] M2 [200
ms] P1 [200 ms] M1 [400 ms] M1 [200 ms] because the
dwell time of the first instance of M (100 ms + 200 ms) is
less than the dwell time of its second instance (400 ms +
200 ms).</p>

        <p>When the algorithm differentiates the instances, it
starts the discovery of the trending instances. When the
total number of fixations on a particular instance is
greater or equal to the minimum total number of fixations on
the fully shared instances of the user scanpaths, and the
dwell time on the instance is greater or equal to the
minimum dwell time on the fully shared instances, the
instance is defined as a trending instance. After the
discovery of the trending instances, the algorithm removes other
instances from the user scanpaths as they will not be in
the trending path.</p>

        <p><bold>Second Pass.</bold> In the second pass, the user scanpaths
are firstly collapsed by merging the same instances to
determine the exact positions of the instances in the
scanpaths because the second pass computes the sequential
priority value for each instance in each scanpath by using
the positions. For example, when the following scanpath
is available M1 [150 ms] P1 [125 ms] P1 [150 ms] Q1
[125 ms], the positions of M1, P1 and Q1 should be zero,
one and two respectively (the starting position is zero).
When the algorithm merges the same instances, it
combines them into one instance where its duration is equal to
the total duration of the instances (such as, M1 [150 ms]
P1 [125 ms] P1 [150 ms] Q1 [125 ms] &#x2192; M1 [150 ms]
P1 [275 ms] Q1 [125 ms]). The total number of fixations
on each instance (i.e., the number of occurrence for each
instance) is also stored. After that, the sequential priority
value (&#x3C8;) of each instance in each user scanpath is
computed with Equation (1) where P represents the instance
position in the scanpath and L represents the length of the
user scanpath. 1 and 0.1 are given the max and min values
respectively to give 1 point the first instance and 0.1
points to the last instance.</p>

<fig id="eq01" fig-type="equation" position="anchor">
                     <label>(1)</label>
					<graphic id="equation01" xlink:href="jemr-10-04-f-equation-01.png"/>
				</fig>
								
        <p>The total priority value (&#x3A8;) for each instance is then
computed with Equation (2) where n represents the
number of user scanpaths. In particular, if there are five
different instances in total (such as, M1, M2, P1, P2 and
Q1), then the total priority value is calculated for each of
them. The total priority values of the instances are then
used by the algorithm to locate the instances in the
trending path in descending order, thus the overall positions of
the instances in the user scanpaths are preserved. If more
than one instance share the same total priority value, their
dwell time and the total number of fixations on the
instances are also taken into consideration. In the end, the
algorithm removes the numbers of the instances (such as,
M1 &#x2192; M), and then deletes the consecutive repetitions
(such as, MPPQ &#x2192; MPQ) to represent the trending
scanpath in terms of the AOIs.</p>

<fig id="eq02" fig-type="equation" position="anchor">
                     <label>(2)</label>
					<graphic id="equation02" xlink:href="jemr-10-04-f-equation-03.png"/>
				</fig>

        <p>As an example, Figure 3 illustrates the trending
scanpath on the Babylon page which was constructed with the
STA algorithm by using the scanpaths in Figure 4. The 
trending scanpath shows us which AOIs are most
commonly used and in which order. If we remove the unused
AOIs and re-organise the remaining ones based on the
trending scanpath, we would provide faster download of
the page and direct access to the commonly used AOIs
based on the desired order.</p>

<fig id="fig03" fig-type="figure" position="float">
					<label>Figure 3.</label>
					<caption>
						<p>The trending scanpath on the Babylon web page</p>
					</caption>
					<graphic id="graph03" xlink:href="jemr-10-04-f-figure-03.png"/>
				</fig>
				
<fig id="fig04" fig-type="figure" position="float">
					<label>Figure 4.</label>
					<caption>
						<p>Multiple scanpaths on the Babylon web page</p>
					</caption>
					<graphic id="graph04" xlink:href="jemr-10-04-f-figure-04.png"/>
				</fig>				
      </sec>
	  
      <sec id="S4b">
        <title>Eye Tracking Study</title>
		
        <p>To investigate whether the possibility of achieving
almost the same results with fewer users in scanpath trend
analysis (
          <xref ref-type="bibr" rid="R38">6</xref>
          ) is restricted to a particular segmentation
approach, we used the same dataset which is from an eye
tracking study. This study is briefly described below and
its full description can be found in (
          <xref ref-type="bibr" rid="R38 R40">6, 8</xref>
          ).</p>
		  
        <p><bold>Equipment.</bold> Tobii T60 eye tracker was used to record
the eye movements of the users. It was built-in a 17''
monitor and the screen resolution of the monitor was
adjusted to 1280 x 1024.</p>

        <p><bold>Pages.</bold> The web pages used in the eye tracking study
were randomly chosen from a group of pages used by
Akp&#x131;nar and Ye&#x15F;ilada (
          <xref ref-type="bibr" rid="R41">9</xref>
          ). In their study, they investigated
the visual complexities of the home pages of the top 100
websites listed by Alexa.com by using the ViCRAM tool
(
          <xref ref-type="bibr" rid="R59">27</xref>
		  ), and then created three groups of pages based on
their complexity (low, medium and high) where each
group contains 10 randomly chosen pages. For this eye
tracking study, two pages were randomly selected from
each group: Apple &#x26; Babylon from the low complexity
group, AVG &#x26; Yahoo from the medium complexity
group, Godaddy &#x26; BBC from the high complexity group
(see Appendix).</p>

        <p>Some of the web pages contain much more text
compared to others. For example, the Yahoo page contains
much more text (353 words) compared to the Apple page
(86 words). To quantitatively illustrate the text density of
the pages, we referred to the text density function in
Equation (3) given by Spool, Scanlon (
          <xref ref-type="bibr" rid="R60">28</xref>
		  ). Since the
pages were shown in a fixed screen, we only report the
total number of words seen by the participants on each
page in Table 1.</p>

<fig id="eq03" fig-type="equation" position="anchor">
                     <label>(3)</label>
					<graphic id="equation03" xlink:href="jemr-10-04-f-equation-03.png"/>
				</fig>
				
<table-wrap id="t01" position="float">
					<label>Table 1.</label>
					<caption>
						<p>The total number of words on a page</p>
					</caption>
					<table frame="hsides" rules="groups" cellpadding="3">
						<thead>
          <tr>
            <td rowspan="1" colspan="1">
              <bold>Page</bold>
            </td>
            <td rowspan="1" colspan="1">
              <bold>The total number of words on a page</bold>
            </td>
          </tr>
						</thead>
						<tbody>
          <tr>
            <td rowspan="1" colspan="1">Apple</td>
            <td rowspan="1" colspan="1">86</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">Babylon</td>
            <td rowspan="1" colspan="1">156</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">AVG</td>
            <td rowspan="1" colspan="1">162</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">Yahoo</td>
            <td rowspan="1" colspan="1">353</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">Godaddy</td>
            <td rowspan="1" colspan="1">163</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">BBC</td>
            <td rowspan="1" colspan="1">300</td>
          </tr>
						</tbody>
					</table>
					</table-wrap>				

        <p><bold>User Tasks.</bold> The participants were requested to
perform two different kinds of tasks on the web pages called
browsing and searching tasks. For the browsing tasks,
there was no specific objective, so the participants freely
browsed on the pages. For the searching tasks, the
participants needed to locate some specific information or items
on the web pages. The searching tasks used in the eye
tracking study are provided in Table 2.</p>

<table-wrap id="t02" position="float">
					<label>Table 2.</label>
					<caption>
						<p>The searching tasks used in the eye tracking study</p>
					</caption>
					<table frame="hsides" rules="groups" cellpadding="3">
						<thead>
          <tr>
            <td rowspan="1" colspan="1">
              <bold>Page</bold>
            </td>
            <td rowspan="1" colspan="1">
              <bold>Tasks</bold>
            </td>
          </tr>
						</thead>
						<tbody>
          <tr>
            <td rowspan="1" colspan="1">Apple</td>
            <td rowspan="1" colspan="1">(a) Can you locate a link which allows watching the TV ads relating to iPad mini?<break/>(b) Can you locate a link labelled iPad on the main menu?</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">Babylon</td>
            <td rowspan="1" colspan="1">(a) Can you locate a link that you can download the free version of Babylon?<break/>(b) Can you find and read the names of other products of Babylon?</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">AVG</td>
            <td rowspan="1" colspan="1">(a) Can you locate a link which you can download a free trial of AVG Internet Security 2013?<break/>(b) Can you locate a link which allows you to download AVG Anti-virus Free 2013?</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">Yahoo</td>
            <td rowspan="1" colspan="1">(a) Can you read the titles of the main headlines which have smaller images?<break/>(b) Can you read the first item under the News title?</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">Godaddy</td>
            <td rowspan="1" colspan="1">(a) Can you find a telephone number for technical support and read it?<break/>(b) Can you locate a text box where you can search for a new domain?</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">BBC</td>
            <td rowspan="1" colspan="1">(a) Can you read the first item of the Sport News?<break/>(b) Can you locate the table that shows market data under the Business title?</td>
          </tr>
						</tbody>
					</table>
					</table-wrap>

        <p><bold>Procedure.</bold> The study was performed in a quiet room.
Before the participants viewed the web pages, they read
an information sheet about the study and singed a consent
form. Their gender, age-group and education level were
then asked. The participants were also asked to rank the
web pages based on their usage (Daily, Weekly, Monthly,
Less than once a month, Never). After that, they viewed
the web pages twice in a random order with
counterbalancing for the browsing (30 seconds) and searching tasks
(max 120 seconds). For example, one participant
completed the tasks in the following order: &#x201C;Babylon - Search,
AVG - Search, Apple - Browse, Godaddy - Search Yahoo
- Browse, Babylon - Browse, Apple - Search, Yahoo
Search, Godaddy - Browse, AVG - Browse, BBC
Browse and BBC &#x2013; Search&#x201D; and the other one completed
in this order: &#x201C;BBC - Search, Babylon - Browse, BBC
Browse, Babylon - Search, AVG - Search, Godaddy
Browse, AVG - Browse, Apple - Search, Yahoo
Browse, Yahoo - Search, Apple - Browse, Godaddy
Search'&#x201D;. The participants could be familiar with the
pages during their first visits and this situation could affect
how they interact with the pages during their second
visits. Hence, for each web page, one half of the participants
firstly completed the searching task and then the browsing
task whereas another half firstly completed the browsing
task and then the searching task. The tasks were read by
the researcher to the participants when the relevant pages
were shown on the screen during the eye tracking
sessions. The participants were not allowed to use a
keyboard and a mouse as the tasks could be completed by
only scanning the pages. When they completed their eye
tracking sessions, they were asked to draw what they
remember on the pages. Apart from the fixations made by
the participants after completing the searching tasks, none
of the fixations were excluded from the analysis.</p>

        <p><bold>Participants.</bold> The eye tracking study conducted with
81 users at two different universities with 40 female and
41 male users. Most of these users were students from the
universities and they were mainly between the ages of
1834. The pages used in this study were not regularly visited
by the users. The percentage of the users who reported
that they never visited the pages or visited them less than
once a month were as follows: Apple: 71.6%, Babylon:
93.8%, AVG: 92.6%, Yahoo: 61.7%, Godaddy: 98.8%
and BBC: 46.9%.</p>
      </sec>
	  
      <sec id="S4c">
        <title>Page Segmentation Approaches</title>
		
        <p>Web pages can be segmented automatically or
manually based on the goals of studies (
          <xref ref-type="bibr" rid="R39">7</xref>
          ). In the previous
study to investigate the possibility of achieving almost the
same results with less users in scanpath trend analysis, the
experiments were conducted with the extended and
improved version of the VIPS approach to automatically
segment the pages (
          <xref ref-type="bibr" rid="R38 R41">6, 9</xref>
          ). In this current paper, we used
the BOM approach for the automatic segmentation of the
web pages (
          <xref ref-type="bibr" rid="R44">12</xref>
          ). Similar to the VIPS approach, the BOM
approach also uses the DOM structure and the visual
representation of web pages but it computationally
applies different algorithms. Specifically, in the BOM
approach, the segments are also grouped based on the four
Gestalt laws (Proximity, Similarity, Closure and
Simplicity). Both of the segmentation approaches have also a
granularity level parameter which affects the size of
segments and the number of segments. In our experiments,
we used the BOM approach with its default value (0.3)
for this parameter.</p>

        <p>The current publicly available implementation of the
BOM approach (
<ext-link ext-link-type="uri" xlink:href="http://www-poleia.lip6.fr/~sanojaa/BOM/" xlink:show="new">http://www-poleia.lip6.fr/~sanojaa/BOM/</ext-link>
)could not properly segment some of the web pages. Thus,
we followed a systematic approach to fix the problems
(
          <xref ref-type="bibr" rid="R61">29</xref>
		  ) by considering the algorithm given in their papers
(
          <xref ref-type="bibr" rid="R44 R62">12, 30</xref>
          ): (1) If an element covered only space, the
element was ignored. (2) If a larger element covered other
smaller elements and not all parts of this larger element
were covered by the smaller elements, the larger element
was used. This was to ensure that all parts of the larger
element are covered by an element. (3) If there were
overlaps between multiple elements, the borders of the
elements were adjusted to deal with the overlaps. (4) If there
was an element which was not located in any element as a
result of the segmentation, it was located in the nearest
element. As an example, Figure 5 shows how the Babylon
page segmented by the current publicly available
implementation of the BOM approach. The menu item for the
language selection was not covered by any block, and
therefore it was located in its nearest block L13 (The
menu item &#x201C;Store&#x201D;). Figure 6 shows how the Babylon
page was divided into their elements by using the BOM
approach with a systematic approach for fixing the
problems.</p>

<fig id="fig05" fig-type="figure" position="float">
					<label>Figure 5.</label>
					<caption>
						<p>The Babylon page segmented by the current publicly available implementation of the BOM approach</p>
					</caption>
					<graphic id="graph05" xlink:href="jemr-10-04-f-figure-05.png"/>
				</fig>
				
<fig id="fig06" fig-type="figure" position="float">
					<label>Figure 6.</label>
					<caption>
						<p>The Babylon page segmented by the BOM segmentation approach [The segmentation problems highlighted in Figure 5 are addressed in this version]</p>
					</caption>
					<graphic id="graph06" xlink:href="jemr-10-04-f-figure-06.png"/>
				</fig>				

        <p>We also followed a systematic way to find other
automatic page segmentation approaches with a public
implementation by looking at the relevant papers and
searching from the web. As illustrated in Table 3, other
segmentation approaches do not have their public
implementations and/or they are not applicable (NA) for this
study. If we could not find the implementation of the
proposed approach, we sent an email to the author(s) to
ask whether they could provide us the implementation of
their proposed segmentation approach. If the authors did
not respond to our email, we assume that their
segmentation approaches do not have a public implementation.
Moreover, not all segmentation approaches are suitable
for our study. In particular, if a segmentation approach
focusses on getting specific parts of web pages, it is
considered as not applicable (NA) for our study as we are not
interested in only specific parts of web pages. For
example, one approach focusses on getting the main content of
a web page instead of segmenting the web page into its
visual elements (
          <xref ref-type="bibr" rid="R63">31</xref>
		  ). There are also some approaches
which are outdated such as the approach of
MilicFrayling and Sommerer (
          <xref ref-type="bibr" rid="R64">32</xref>
		  ) which focusses on HTML
table tags. These approaches are also classified as NA.
Although the page segmentation approach of
Michailidou, Harper (
          <xref ref-type="bibr" rid="R65">33</xref>
		  ) is publicly available, it has
some bugs and does not generate a proper segmentation.</p>

<table-wrap id="t03" position="float">
					<label>Table 3.</label>
					<caption>
						<p>The public availability of the page segmentation approaches [+: Available, -: Not Available, NA: Not Applicable] - The references are in (
          <xref ref-type="bibr" rid="R39">7</xref>)</p>
					</caption>
					<table frame="hsides" rules="groups" cellpadding="3">
						<thead>
          <tr>
            <td rowspan="1" colspan="1">
              <bold>Category</bold>
            </td>
            <td rowspan="1" colspan="1">
              <bold>Source</bold>
            </td>
            <td rowspan="1" colspan="1">
              <bold>Availability</bold>
            </td>
          </tr>
						</thead>
						<tbody>
          <tr>
            <td rowspan="1" colspan="1">Clustering</td>
            <td rowspan="1" colspan="1">Alcic and Conrad (2011)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">Custom</td>
            <td rowspan="1" colspan="1">Cheng and Gotz (2009)</td>
            <td rowspan="1" colspan="1">NA</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Yang and Shi (2009) &#x26; Xiang et al. (2007)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Guo et al. (2007)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Gupta et al. (2007)</td>
            <td rowspan="1" colspan="1">NA</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Hattori et al. (2007)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Gu et al. (2002)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Chen et al. (2001)</td>
            <td rowspan="1" colspan="1">NA</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Sanoja and Gan&#xE7;arski (<xref ref-type="bibr" rid="R34">12</xref>) &#x26; Sanoja and Gan&#xE7;arski (<xref ref-type="bibr" rid="R62">30</xref>)- BOM</td>
            <td rowspan="1" colspan="1">+</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Wei, Lu (<xref ref-type="bibr" rid="R66">34</xref>)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Zeleney, Burget (<xref ref-type="bibr" rid="R67">35</xref>)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">DOM-Based</td>
            <td rowspan="1" colspan="1">Fauzi et al. (2009)</td>
            <td rowspan="1" colspan="1">NA</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Vineel (2009)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Xiao et al. (2008)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Chen et al. (2005, 2003)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Yin and Lee (2005)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Liu et al. (2004)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Lin and Ho (2002)</td>
            <td rowspan="1" colspan="1">NA</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">Heuristics</td>
            <td rowspan="1" colspan="1">Toh and Hong (<xref ref-type="bibr" rid="R68">36</xref>)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Burget and Rudolfova (2009)</td>
            <td rowspan="1" colspan="1">NA</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Burget (2007)</td>
            <td rowspan="1" colspan="1"/>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Ahmadi and Kong (2008)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Michailidou, Harper (<xref ref-type="bibr" rid="R65">33</xref>)</td>
            <td rowspan="1" colspan="1">+</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Kreuzer, Hage (<xref ref-type="bibr" rid="R69">37</xref>)</td>
            <td rowspan="1" colspan="1">NA</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Debnath et al. (2005)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Kovacevic et al. (2002)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Milic-Frayling and Sommerer (<xref ref-type="bibr" rid="R64">32</xref>)</td>
            <td rowspan="1" colspan="1">NA</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">Image Processing Algorithms</td>
            <td rowspan="1" colspan="1">Cao et al. (2010)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">Machine Learning</td>
            <td rowspan="1" colspan="1">Chakrabarti et al. (2008)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Borodin et al. (2007)</td>
            <td rowspan="1" colspan="1">NA</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Mahmud et al. (2007)</td>
            <td rowspan="1" colspan="1">NA</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Baluja (2006)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Bing, Guo (<xref ref-type="bibr" rid="R70">38</xref>)</td>
            <td rowspan="1" colspan="1">NA</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Feng, Zhang (<xref ref-type="bibr" rid="R71">39</xref>)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">Pattern Matching</td>
            <td rowspan="1" colspan="1">Xiang and Shi (2006)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Nanno et al. (2004)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Cuzzolaa, Jovanović (<xref ref-type="bibr" rid="R72">40</xref>)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Whang et al. (2001) &#x26; Hwang et al. (2003)</td>
            <td rowspan="1" colspan="1">NA</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">Ranking-Based</td>
            <td rowspan="1" colspan="1">Yin and Lee (2004)</td>
            <td rowspan="1" colspan="1">NA</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">Text-Based</td>
            <td rowspan="1" colspan="1">Sun et al. (2011)</td>
            <td rowspan="1" colspan="1">NA</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Kohlsch&#xFC;tter, Fankhauser (<xref ref-type="bibr" rid="R63">31</xref>)</td>
            <td rowspan="1" colspan="1">NA</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Kohlsch&#xFC;tter (2009)</td>
            <td rowspan="1" colspan="1">NA</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">Kohlsch&#xFC;tter and Nejdl (2008)</td>
            <td rowspan="1" colspan="1">-</td>
          </tr>
						</tbody>
					</table>
					</table-wrap>

        <p>We also manually segmented the web pages as
manual segmentation can also be preferred by researchers
based on their goals (
          <xref ref-type="bibr" rid="R45">13</xref>
          ). In order to be more objective,
we did not want to do the segmentation by ourselves, and
therefore we used the recall approach. At the end of the
eye tracking study described above, the participants were
asked to draw what they remember about the layouts of
the web pages on a piece of paper after they completed
their tasks on the web pages. Figure 7 and Figure 8 shows
how the Babylon web page was drawn by two
participants. We systematically combined their drawings to
identify the AOIs of the web pages. To be more objective,
three researchers firstly combined the drawings for each
web page, including other researchers who are not authors
of this paper. For example, they noted how many users
divided the main menu into its elements and how many of
users drew the main menu as an entire element. Once they
combined the drawings, their notes were analysed to see
all the elements drawn by the users. If there was a larger
element that covers other smaller elements, the smaller
elements were used as they were more specific. For
example, if a menu was divided into their items, the menu
items were considered as separate elements instead of
using the menu as one element. Even though we applied
the same approach here as explained for the BOM
approach (
          <xref ref-type="bibr" rid="R44">12</xref>
          ) (If a larger element covered other smaller
elements and not all parts of this larger element were
covered by the smaller elements, the larger element was
used), we used the smaller elements since all parts of the
larger element were covered by the smaller elements here.
As an example, Figure 9 shows how the Babylon page is
segmented by using this approach.</p>

<fig id="fig07" fig-type="figure" position="float">
					<label>Figure 7.</label>
					<caption>
						<p>The Babylon page drawn by the participant 1</p>
					</caption>
					<graphic id="graph07" xlink:href="jemr-10-04-f-figure-07.png"/>
				</fig>	
				
<fig id="fig08" fig-type="figure" position="float">
					<label>Figure 8.</label>
					<caption>
						<p>The Babylon page drawn by the participant 2</p>
					</caption>
					<graphic id="graph08" xlink:href="jemr-10-04-f-figure-08.png"/>
				</fig>

<fig id="fig09" fig-type="figure" position="float">
					<label>Figure 9.</label>
					<caption>
						<p>The Babylon page segmented by the user driven segmentation approach</p>
					</caption>
					<graphic id="graph09" xlink:href="jemr-10-04-f-figure-09.png"/>
				</fig>				

        <p>The overview of the differences between the
segmentations of the VIPS, BOM and user driven approaches in
terms of the number of AOIs generated and their
coverage on the web pages can be seen from Table 4. Some
parts of a particular web page may be located in separate
AOIs with some segmentation approaches whereas the
same parts may be located in only one AOI with other
approaches. For example, each of the other Babylon
products (&#x201C;Full Text Translation&#x201D;, &#x201C;Human Translation&#x201D;,
&#x201C;Learn a Language&#x201D; and &#x201C;Corporate Customers&#x201D;) was
located in a separate AOI with the VIPS (with its most
satisfactory granularity (
          <xref ref-type="bibr" rid="R41">9</xref>
          )) and user driven segmentation
approaches (see Figure 1 and Figure 9) however all these
products are located in the same AOI with the BOM
segmentation approach (with its default granularity
parameter) (see Figure 6). As illustrated in Table 4, the VIPS
approach provides more and smaller segments on the
Babylon, AVG, and BBC web pages in comparison with
the BOM and user driven approaches. In contrast, it
provides fewer and larger segments on the Yahoo page.</p>

<table-wrap id="t04" position="float">
					<label>Table 4.</label>
					<caption>
						<p>The % of the coverage of the AOIs generated by the VIPS, BOM and user driven approaches [N: The number of AOIs, M: Mean, MD: Median, SD: Standard Deviation]</p>
					</caption>
					<table frame="hsides" rules="groups" cellpadding="3">
						<thead>
          <tr>
            <td rowspan="1" colspan="1">
              <bold>Page</bold>
            </td>
            <td rowspan="1" colspan="1">
              <bold>Approach</bold>
            </td>
            <td rowspan="1" colspan="1">
              <bold>N</bold>
            </td>
            <td rowspan="1" colspan="1">
              <bold>M</bold>
            </td>
            <td rowspan="1" colspan="1">
              <bold>MD</bold>
            </td>
            <td rowspan="1" colspan="1">
              <bold>SD</bold>
            </td>
            <td rowspan="1" colspan="1">
              <bold>Max</bold>
            </td>
            <td rowspan="1" colspan="1">
              <bold>Min</bold>
            </td>
          </tr>
						</thead>
						<tbody>
          <tr>
            <td rowspan="1" colspan="1">Apple</td>
            <td rowspan="1" colspan="1">VIPS</td>
            <td rowspan="1" colspan="1">18</td>
            <td rowspan="1" colspan="1">2.68</td>
            <td rowspan="1" colspan="1">0.82</td>
            <td rowspan="1" colspan="1">5.46</td>
            <td rowspan="1" colspan="1">23.57</td>
            <td rowspan="1" colspan="1">0.06</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">BOM</td>
            <td rowspan="1" colspan="1">7</td>
            <td rowspan="1" colspan="1">7.54</td>
            <td rowspan="1" colspan="1">3.38</td>
            <td rowspan="1" colspan="1">9.57</td>
            <td rowspan="1" colspan="1">26.58</td>
            <td rowspan="1" colspan="1">0.77</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">User driven</td>
            <td rowspan="1" colspan="1">18</td>
            <td rowspan="1" colspan="1">2.58</td>
            <td rowspan="1" colspan="1">0.76</td>
            <td rowspan="1" colspan="1">4.84</td>
            <td rowspan="1" colspan="1">21.03</td>
            <td rowspan="1" colspan="1">0.30</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">Babylon</td>
            <td rowspan="1" colspan="1">VIPS</td>
            <td rowspan="1" colspan="1">22</td>
            <td rowspan="1" colspan="1">1.98</td>
            <td rowspan="1" colspan="1">1.60</td>
            <td rowspan="1" colspan="1">1.65</td>
            <td rowspan="1" colspan="1">5.27</td>
            <td rowspan="1" colspan="1">0.24</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">BOM</td>
            <td rowspan="1" colspan="1">13</td>
            <td rowspan="1" colspan="1">2.79</td>
            <td rowspan="1" colspan="1">1.05</td>
            <td rowspan="1" colspan="1">4.39</td>
            <td rowspan="1" colspan="1">16.09</td>
            <td rowspan="1" colspan="1">0.06</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">User driven</td>
            <td rowspan="1" colspan="1">17</td>
            <td rowspan="1" colspan="1">2.62</td>
            <td rowspan="1" colspan="1">0.56</td>
            <td rowspan="1" colspan="1">3.98</td>
            <td rowspan="1" colspan="1">15.58</td>
            <td rowspan="1" colspan="1">0.14</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">AVG</td>
            <td rowspan="1" colspan="1">VIPS</td>
            <td rowspan="1" colspan="1">25</td>
            <td rowspan="1" colspan="1">1.97</td>
            <td rowspan="1" colspan="1">0.45</td>
            <td rowspan="1" colspan="1">4.65</td>
            <td rowspan="1" colspan="1">20.93</td>
            <td rowspan="1" colspan="1">0.07</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">BOM</td>
            <td rowspan="1" colspan="1">7</td>
            <td rowspan="1" colspan="1">6.99</td>
            <td rowspan="1" colspan="1">1.82</td>
            <td rowspan="1" colspan="1">8.16</td>
            <td rowspan="1" colspan="1">21.05</td>
            <td rowspan="1" colspan="1">0.32</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">User driven</td>
            <td rowspan="1" colspan="1">20</td>
            <td rowspan="1" colspan="1">2.41</td>
            <td rowspan="1" colspan="1">2.47</td>
            <td rowspan="1" colspan="1">1.90</td>
            <td rowspan="1" colspan="1">8.35</td>
            <td rowspan="1" colspan="1">0.44</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">Yahoo</td>
            <td rowspan="1" colspan="1">VIPS</td>
            <td rowspan="1" colspan="1">10</td>
            <td rowspan="1" colspan="1">6.93</td>
            <td rowspan="1" colspan="1">1.94</td>
            <td rowspan="1" colspan="1">9.57</td>
            <td rowspan="1" colspan="1">25.78</td>
            <td rowspan="1" colspan="1">0.19</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">BOM</td>
            <td rowspan="1" colspan="1">12</td>
            <td rowspan="1" colspan="1">5.09</td>
            <td rowspan="1" colspan="1">1.56</td>
            <td rowspan="1" colspan="1">7.13</td>
            <td rowspan="1" colspan="1">24.78</td>
            <td rowspan="1" colspan="1">0.08</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">User driven</td>
            <td rowspan="1" colspan="1">24</td>
            <td rowspan="1" colspan="1">2.57</td>
            <td rowspan="1" colspan="1">0.92</td>
            <td rowspan="1" colspan="1">2.88</td>
            <td rowspan="1" colspan="1">9.14</td>
            <td rowspan="1" colspan="1">0.10</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">Godaddy</td>
            <td rowspan="1" colspan="1">VIPS</td>
            <td rowspan="1" colspan="1">16</td>
            <td rowspan="1" colspan="1">4.16</td>
            <td rowspan="1" colspan="1">0.52</td>
            <td rowspan="1" colspan="1">9.44</td>
            <td rowspan="1" colspan="1">35.15</td>
            <td rowspan="1" colspan="1">0.10</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">BOM</td>
            <td rowspan="1" colspan="1">11</td>
            <td rowspan="1" colspan="1">5.64</td>
            <td rowspan="1" colspan="1">1.46</td>
            <td rowspan="1" colspan="1">7.21</td>
            <td rowspan="1" colspan="1">18.42</td>
            <td rowspan="1" colspan="1">0.11</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">User driven</td>
            <td rowspan="1" colspan="1">19</td>
            <td rowspan="1" colspan="1">2.94</td>
            <td rowspan="1" colspan="1">1.18</td>
            <td rowspan="1" colspan="1">4.36</td>
            <td rowspan="1" colspan="1">17.61</td>
            <td rowspan="1" colspan="1">0.35</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">BBC</td>
            <td rowspan="1" colspan="1">VIPS</td>
            <td rowspan="1" colspan="1">21</td>
            <td rowspan="1" colspan="1">3.22</td>
            <td rowspan="1" colspan="1">0.95</td>
            <td rowspan="1" colspan="1">4.26</td>
            <td rowspan="1" colspan="1">12.90</td>
            <td rowspan="1" colspan="1">0.15</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">BOM</td>
            <td rowspan="1" colspan="1">5</td>
            <td rowspan="1" colspan="1">14.43</td>
            <td rowspan="1" colspan="1">8.79</td>
            <td rowspan="1" colspan="1">16.28</td>
            <td rowspan="1" colspan="1">43.22</td>
            <td rowspan="1" colspan="1">4.24</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">User driven</td>
            <td rowspan="1" colspan="1">10</td>
            <td rowspan="1" colspan="1">6.72</td>
            <td rowspan="1" colspan="1">6.62</td>
            <td rowspan="1" colspan="1">3.85</td>
            <td rowspan="1" colspan="1">14.45</td>
            <td rowspan="1" colspan="1">0.17</td>
          </tr>
						</tbody>
					</table>
					</table-wrap>

        <p>The segmentation of the pages with each of these
segmentation approaches is visualised and provided in
our external repository. Once the web pages were
segmented by using both the BOM segmentation approach
(
          <xref ref-type="bibr" rid="R44">12</xref>
          ) and the user driven approach, we replicated the
experiments of our previous study (
          <xref ref-type="bibr" rid="R38">6</xref>
          ) by using the same
methodology as described in the following section.</p>
      </sec>
	  
      <sec id="S4d">
        <title>Analysis Procedure</title>
		
        <p>As some of the users could not successfully complete
their tasks on some of the web pages and/or experienced
some eye calibration problems, they were classified as
unsuccessful users and excluded from the experiments. If
a particular user was unsuccessful on the page X but
successful on the page Y, s/he was excluded from only the
page X. Due to this reason, the number of users on the
web pages for the browsing and searching tasks were not
the same. However, it would be better to have the same
number of users on all the pages for both types of the
tasks for the consistency among the pages and estimation
of the overall graph. Thus, some of the successful users
were also randomly excluded from the experiments. As a
result, the experiments were conducted with 65 users on
each web page for the browsing and searching tasks.</p>

        <p>The STA algorithm was then applied to the users to
discover their trending scanpaths on the pages which were
segmented by using the BOM approach (
          <xref ref-type="bibr" rid="R44">12</xref>
          ) and the user
driven approach. Following this, smaller groups of users
were chosen with the size of one user to 64 users from the
successful users and then the STA algorithm was applied
to these groups to discover their trending scanpaths. In
other words, sub-groups were randomly created with
different sizes from 1 user to 64 users and their trending
scanpath were identified with the STA algorithm (i.e.,
group of one user, group of two users, group of three
users &#x2026; group of 64 users). The results of the smaller
groups were then compared with the results of the entire
group. To deal with any possible effects of the selected
users, 100 different combinations were generated for each
group size and their median similarity was taken as a
similarity to the entire the group to counteract extreme
cases (
          <xref ref-type="bibr" rid="R50">18</xref>
          ).</p>
		  
        <p>To compare two scanpaths, the Levenshtein Distance
(String-edit) algorithm was used as it has widely been
used to determine the distance between two scanpaths
(
          <xref ref-type="bibr" rid="R73 R74">41, 42</xref>
          ). This algorithm transforms one scanpath to
another by using the minimum number of addition, deletion
and substitution operations. The minimum number of
operations shows the distance between the two scanpaths.
For instance, the distance between DEFG and DEHG is
equal to one because it is sufficient to substitute F with H
to transform one of them to another or vice versa.</p>

        <p>The sizes of AOIs and the distances between the AOIs
are potentially different, as a consequence the substitution
costs between all pairs of AOIs may not be the same (
          <xref ref-type="bibr" rid="R73">41</xref>
          ).
A substitution matrix has been suggested to take this issue
into account (
          <xref ref-type="bibr" rid="R74">42</xref>
          ). This matrix should include substitution
costs between all pairs of AOIs. As suggested by
Takeuchi and Habuchi (
          <xref ref-type="bibr" rid="R74">42</xref>
          ), the Euclidean distances
between the AOIs were used to generate a substitution
matrix in the experiments. Equation (4) below illustrates
how to calculate a substitution cost between two AOIs <italic>I</italic>
and <italic>J</italic> based on the Euclidean distance where <italic>I&#x2081;</italic> and <italic>I&#x2082;</italic> are
x and y coordinates of the centre of the element <italic>I</italic> and &#x3B1; is
a type of normalisation parameter (
          <xref ref-type="bibr" rid="R74">42</xref>
          ). The normalisation
parameter was taken as 0.001 (
          <xref ref-type="bibr" rid="R74">42</xref>
          ). The matrix was then
integrated into the String-edit algorithm to determine the
distance between two scanpaths by computing the
minimum cost for transforming one scanpath to another.</p>

<fig id="eq04" fig-type="equation" position="anchor">
                     <label>(4)</label>
					<graphic id="equation04" xlink:href="jemr-10-04-f-equation-04.png"/>
				</fig>

        <p>To find a similarity between two scanpaths as a
percentage based on the String-edit distance (
          <xref ref-type="bibr" rid="R75">43</xref>
          ), the
distance (<italic>D</italic>) between two scanpaths was firstly divided by
the length of the longer scanpath (<italic>L</italic>) to calculate a
normalised score to prevent possible inconsistencies that can
be caused by different lengths. The normalised score was
secondly subtracted from one and finally multiplied by
100. The relevant formula is shown in Equation (5). For
example, the similarity between DEFG and DEHG is
calculated as 75% as the distance (<italic>D</italic>) between these
scanpaths is equal to one (the substitution between F and
H), and the length of the longer scanpath (<italic>L</italic>) is equal to 4
(both of them have the same length in this example).</p>

<fig id="eq05" fig-type="equation" position="anchor">
                     <label>(5)</label>
					<graphic id="equation05" xlink:href="jemr-10-04-f-equation-05.png"/>
				</fig>

        <p>As we investigated how many users are needed to
achieve almost the same results as 65 users, our
dependent variable was the number of users and our independent
variable was the similarity to the results of 65 users. Once
we calculated the similarity to the entire group for each
smaller group, the curve estimation feature of SPSS
(
<ext-link ext-link-type="uri" xlink:href="http://www.ibm.com/analytics/us/en/technology/spss/" xlink:show="new">http://www.ibm.com/analytics/us/en/technology/spss/</ext-link>
)
was applied to select the best curve that fits into the mean
of these similarities on the six web pages for the browsing
and searching tasks. Specifically, the curve with the
minimum standard error of the estimate was selected.</p>
      </sec>
    </sec>
	
    <sec id="S5">
      <title>Results</title>
	  
      <p>Figure 10 and Figure 11 show the estimated curves by
using the VIPS (
        <xref ref-type="bibr" rid="R41">9</xref>
        ), BOM (
        <xref ref-type="bibr" rid="R44">12</xref>
        ) and user driven
segmentation approaches on the six web pages for the browsing
and searching tasks with their equations where S is the
similarity to the entire group and i is the number of users
(
        <xref ref-type="bibr" rid="R38">6</xref>
        ). The standard errors of the estimated curves for the
VIPS, BOM and user driven approaches are 0.073, 0.066
and 0.034 for the searching tasks, and 0.044, 0.063 and
0.082 for the browsing tasks respectively.</p>

<fig id="fig10" fig-type="figure" position="float">
					<label>Figure 10.</label>
					<caption>
						<p>The estimated curves by using the VIPS, BOM and user driven segmentation approaches on the six web pages for the searching tasks</p>
					</caption>
					<graphic id="graph10" xlink:href="jemr-10-04-f-figure-10.png"/>
				</fig>

<fig id="fig11" fig-type="figure" position="float">
					<label>Figure 11.</label>
					<caption>
						<p>The estimated curves by using the VIPS, BOM and user driven segmentation approaches on the six web pages for the browsing tasks</p>
					</caption>
					<graphic id="graph11" xlink:href="jemr-10-04-f-figure-11.png"/>
				</fig>					

      <p>To compare two estimated curves, the absolute
difference in the curves was calculated for each group size as
illustrated in Table 5. For instance, the difference between
the curves generated by using the VIPS approach and the
BOM approach for three users is equal to 5.87 (With the
BOM: 19.97, With the VIPS: 25.84) for the browsing
tasks and 4.88 (With the BOM: 23.73, With the VIPS:
28.61) for the searching tasks.</p>

<table-wrap id="t05" position="float">
					<label>Table 5.</label>
					<caption>
						<p>The differences between the curves generated by using the VIPS approach and the BOM approach where i is the number of users.</p>
					</caption>
					<table frame="hsides" rules="groups" cellpadding="3">
						<thead>
          <tr>
            <td rowspan="1" colspan="1">
              <bold>Task</bold>
            </td>
            <td rowspan="1" colspan="1">
              <bold>i</bold>
            </td>
            <td rowspan="1" colspan="1">
              <bold>With the BOM</bold>
            </td>
            <td rowspan="1" colspan="1">
              <bold>With the VIPS</bold>
            </td>
            <td rowspan="1" colspan="1">
              <bold>Difference</bold>
            </td>
          </tr>
						</thead>
						<tbody>
          <tr>
            <td rowspan="1" colspan="1">Browsing</td>
            <td rowspan="1" colspan="1">1</td>
            <td rowspan="1" colspan="1">11.91</td>
            <td rowspan="1" colspan="1">15.94</td>
            <td rowspan="1" colspan="1">4.02</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">2</td>
            <td rowspan="1" colspan="1">16.50</td>
            <td rowspan="1" colspan="1">21.62</td>
            <td rowspan="1" colspan="1">5.12</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">3</td>
            <td rowspan="1" colspan="1">19.97</td>
            <td rowspan="1" colspan="1">25.84</td>
            <td rowspan="1" colspan="1">5.87</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">4</td>
            <td rowspan="1" colspan="1">22.86</td>
            <td rowspan="1" colspan="1">29.33</td>
            <td rowspan="1" colspan="1">6.47</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">5</td>
            <td rowspan="1" colspan="1">25.38</td>
            <td rowspan="1" colspan="1">32.35</td>
            <td rowspan="1" colspan="1">6.97</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">&#x2026;</td>
            <td rowspan="1" colspan="1">&#x2026;</td>
            <td rowspan="1" colspan="1">&#x2026;</td>
            <td rowspan="1" colspan="1">&#x2026;</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">Searching</td>
            <td rowspan="1" colspan="1">1</td>
            <td rowspan="1" colspan="1">13.79</td>
            <td rowspan="1" colspan="1">17.67</td>
            <td rowspan="1" colspan="1">3.87</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">2</td>
            <td rowspan="1" colspan="1">19.42</td>
            <td rowspan="1" colspan="1">23.95</td>
            <td rowspan="1" colspan="1">4.52</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">3</td>
            <td rowspan="1" colspan="1">23.73</td>
            <td rowspan="1" colspan="1">28.61</td>
            <td rowspan="1" colspan="1">4.88</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">4</td>
            <td rowspan="1" colspan="1">27.36</td>
            <td rowspan="1" colspan="1">32.47</td>
            <td rowspan="1" colspan="1">5.11</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">5</td>
            <td rowspan="1" colspan="1">30.54</td>
            <td rowspan="1" colspan="1">35.81</td>
            <td rowspan="1" colspan="1">5.26</td>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">&#x2026;</td>
            <td rowspan="1" colspan="1">&#x2026;</td>
            <td rowspan="1" colspan="1">&#x2026;</td>
            <td rowspan="1" colspan="1">&#x2026;</td>
          </tr>
						</tbody>
					</table>
					</table-wrap>

      <p>Table 6 also illustrates the mean, median, maximum,
minimum and standard deviation (SD) of the differences
between the curves generated by using the VIPS, BOM
and user driven approaches. As illustrated in the table, the
mean difference between the curves generated by using
the VIPS approach and the BOM approach is only 4.13%
(SD: 1.13%) for the searching tasks and 11.90% (SD:
2.76%) for the browsing tasks. Furthermore, the mean
difference between the curves generated by using the
VIPS approach and the user driven approach is only
5.88% (SD: 1.47%) for the searching tasks and 11.49%
(SD: 1.62%) for the browsing tasks. Finally, the mean
difference between the curves generated by using the
BOM approach and the user driven approach is only
2.50% (SD: 1.68%) for the searching tasks and 3.23%
(SD: 1.83%) for the browsing tasks.</p>

<table-wrap id="t06" position="float">
					<label>Table 6.</label>
					<caption>
						<p>The mean (M), median (MD), maximum, minimum and standard deviations (SD) of the differences between the curves generated by using the VIPS approach, the BOM approach and the user driven approach where i is the number of users</p>
					</caption>
					<table frame="hsides" rules="groups" cellpadding="3">
						<thead>
          <tr>
            <td rowspan="1" colspan="1">
              <bold>Task</bold>
            </td>
            <td rowspan="1" colspan="1">
              <bold>Approaches</bold>
            </td>
            <td rowspan="1" colspan="1">
              <bold>M</bold>
            </td>
            <td rowspan="1" colspan="1">
              <bold>MD</bold>
            </td>
            <td rowspan="1" colspan="1">
              <bold>Max</bold>
            </td>
            <td rowspan="1" colspan="1">
              <bold>Min</bold>
            </td>
            <td rowspan="1" colspan="1">
              <bold>SD</bold>
            </td>
            <td rowspan="1" colspan="1"/>
          </tr>
						</thead>
						<tbody>
          <tr>
            <td rowspan="1" colspan="1">Browse</td>
            <td rowspan="1" colspan="1">VIPS-BOM</td>
            <td rowspan="1" colspan="1">11.90</td>
            <td rowspan="1" colspan="1">12.60</td>
            <td rowspan="1" colspan="1">15.27</td>
            <td rowspan="1" colspan="1">4.02</td>
            <td rowspan="1" colspan="1">2.76</td>
            <td rowspan="1" colspan="1"/>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">VIPS-User driven</td>
            <td rowspan="1" colspan="1">11.49</td>
            <td rowspan="1" colspan="1">11.85</td>
            <td rowspan="1" colspan="1">13.37</td>
            <td rowspan="1" colspan="1">7.83</td>
            <td rowspan="1" colspan="1">1.62</td>
            <td rowspan="1" colspan="1"/>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">BOM-User driven</td>
            <td rowspan="1" colspan="1">3.23</td>
            <td rowspan="1" colspan="1">3.39</td>
            <td rowspan="1" colspan="1">6.94</td>
            <td rowspan="1" colspan="1">0.00</td>
            <td rowspan="1" colspan="1">1.83</td>
            <td rowspan="1" colspan="1"/>
          </tr>
          <tr>
            <td rowspan="1" colspan="1">Search</td>
            <td rowspan="1" colspan="1">VIPS-BOM</td>
            <td rowspan="1" colspan="1">4.13</td>
            <td rowspan="1" colspan="1">4.33</td>
            <td rowspan="1" colspan="1">5.53</td>
            <td rowspan="1" colspan="1">1.96</td>
            <td rowspan="1" colspan="1">1.13</td>
            <td rowspan="1" colspan="1"/>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">VIPS-User driven</td>
            <td rowspan="1" colspan="1">5.88</td>
            <td rowspan="1" colspan="1">6.22</td>
            <td rowspan="1" colspan="1">7.75</td>
            <td rowspan="1" colspan="1">1.85</td>
            <td rowspan="1" colspan="1">1.47</td>
            <td rowspan="1" colspan="1"/>
          </tr>
          <tr>
            <td rowspan="1" colspan="1"/>
            <td rowspan="1" colspan="1">BOM-User driven</td>
            <td rowspan="1" colspan="1">2.50</td>
            <td rowspan="1" colspan="1">2.08</td>
            <td rowspan="1" colspan="1">5.79</td>
            <td rowspan="1" colspan="1">0.04</td>
            <td rowspan="1" colspan="1">1.68</td>
          </tr>
						</tbody>
					</table>
					</table-wrap>

      <p>To sum up, the previous and current results are mainly
consistent with ~5\% difference for the searching tasks
and ~10\% difference for the browsing tasks.</p>
    </sec>
	
    <sec id="S6">
      <title>Discussion</title>
	  
      <p>Based on the results presented in this paper, we can
suggest that we can see almost the same trend in the
curves when we use the VIPS (
        <xref ref-type="bibr" rid="R41">9</xref>
        ), BOM (
        <xref ref-type="bibr" rid="R44">12</xref>
        ) and user
driven approaches to segment the web pages. In
particular, the results obtained from the BOM and the user
driven approaches are very close to each other. Therefore, we
can suggest that the previous and current results are
consistent with some small deviations (~5\% difference for
the searching tasks and ~10\% difference for the browsing
tasks).</p>

      <p>Although the VIPS approach is popular among
researchers to segment web pages, they can also use other
page segmentation approaches. However, the previous
study did not show whether the possibility of achieving
almost the same results with fewer users is limited to the
VIPS approach. Therefore, when researchers want to use
another segmentation approach with the STA algorithm,
they could not ensure whether the possibility is still valid
for them. Hence, this study is beneficial for them from
this aspect as it shows that we can approximate almost the
same results with a smaller group of users in scanpath
analysis regardless of the segmentation approach used.</p>

      <p>As mentioned in our related work section, although
other aspects of eye tracking data (especially, heat maps
(
        <xref ref-type="bibr" rid="R57">25</xref>
		)) have been studied, the number of users required for
scanpath analysis has not been studied in depth. The
possibility of achieving almost the same results with less
users in scanpath trend analysis has been raised but the
segmentation effect on this possibility was not
investigated. Therefore, this study is an important step forward in
existing research. Furthermore, the results are still
consistent with the suggestions for heat maps (
        <xref ref-type="bibr" rid="R57">25</xref>), especially
for the browsing tasks, even though the analysis of heat
maps are much easier in comparison with scanpath
analysis (see Figure 2 and Figure 4).</p>

      <p>We also observed some differences in the curves
generated by using the VIPS, BOM and user driven
segmentation approaches. Specifically, the VIPS approach has
higher similarities compared to other two segmentation
approaches, especially for the browsing tasks. The
differences in the curves generated for the browsing tasks are
higher in comparison with the differences in the curves
generated for the searching tasks. Since the participants
did not require to locate specific items on the web pages
for the browsing tasks and they were allowed to browse
freely, the variations between the user scanpaths tend to
be higher, and therefore it is quite normal to see higher
difference in the curves generated for the browsing tasks.
We think that the differences in both the searching and
browsing tasks might be caused by the sizes of the AOIs.
In particular, when the BOM approach is used to
automatically segment the web pages, fewer and larger AOIs are
usually generated. Therefore, user scanpaths become
shorter, as a consequence their resulting paths also
become shorter. In these cases, small deviations between
two scanpaths considerably decrease the similarity
between the scanpaths. For example, the similarity between
the scanpaths ABC and ACB is calculated as 33.33% by
the standard String-edit algorithm, even though they have
the same elements (
        <xref ref-type="bibr" rid="R73">41</xref>
        ). To investigate this expectation,
we should conduct further experiments as we cannot say
that the VIPS approach creates smaller blocks in all the
cases in comparison with other approaches. In the further
experiments, the same methodology can be applied by
segmenting web pages with different granularity levels
and the effects of the AOI sizes can be investigated. The
reason for the VIPS approach having higher similarities
could also be that it might provide the most similar AOIs
to the AOIs which were actually used by the participants.
However, this possible reason should be further
investigated by determining the similarities between the AOIs
generated by the segmentation approaches and the AOIs
which were actually used by the users. However, in order
to do this, a systematic approach should firstly be
developed to discover the borders of the AOIs which were
actually used. These borders can then be used for analysis</p>

      <p>A study conducted by Akp&#x131;nar and Ye&#x15F;ilada (
        <xref ref-type="bibr" rid="R41">9</xref>
        )
suggests that more and smaller segments are preferred by
users instead of fewer and larger segments for the VIPS
approach. Therefore, in the previous study (
        <xref ref-type="bibr" rid="R38">6</xref>
        ), the
experiments was conducted by using the suggested granularity
level of the VIPS algorithm. However, in our current
experiments, we had to use the default granularity level of
the BOM approach as no specific suggestion is provided.
Since the granularity level affects the segmentation, it
may also affect the results. Therefore, when a
segmentation approach is proposed, its most
successful/preferable/satisfactory granularity level should also be
provided, if appropriate.</p>

      <p>Because of the limited number of publicly available
automatic segmentation approaches (
        <xref ref-type="bibr" rid="R39">7</xref>
        ), we could only
use the BOM approach for automatically segment the web
pages to cross-check the previous findings obtained by
using the VIPS approach. Since web pages can also be
manually segmented by researchers based on their goals
(
        <xref ref-type="bibr" rid="R45">13</xref>
        ), we also applied a user driven approach to manually
segment the web pages. However, further experiments
need to be conducted by using more segmentation
approaches and different eye tracking datasets with the same
methodology to increase the generalizability of the
probability of accessing almost the same results with fewer
users. For example, it would be worthwhile to investigate
how this possibility is affected when all users are familiar
with web pages or when web pages are segmented into
very small or very large elements. Furthermore, a number
of eye tracking experts could also be invited to discuss
and identify the AOIs of the web pages based on their
experience and then the same methodology could be
applied. However, manual segmentation is typically based
the goals of the study and it is almost impossible to
estimate all possible segmentations. We could also apply the
grid segmentation. However, the elements of the web
pages could be inappropriately divided in that case as the
web pages were not necessarily designed with a grid
layout. For example, Figure 12 shows how the Babylon
page could be segmented with a 4x4 grid segmentation.
As illustrated, some elements are inappropriately divided,
such as the text under the &#x201C;Full Text Translation&#x201D; title.
Therefore, we did not use the grid segmentation in our
study.</p>

<fig id="fig12" fig-type="figure" position="float">
					<label>Figure 12.</label>
					<caption>
						<p>The Babylon page with a 4-4 grid segmentation.</p>
					</caption>
					<graphic id="graph12" xlink:href="jemr-10-04-f-figure-12.png"/>
				</fig>

      <p>In this study, only the STA algorithm was used but the
methodology can also be applied to other scanpath
analysis algorithms to investigate the effects of the number of
users on their results. For example, the SPAM (Sequential
Pattern Mining) algorithm was used by Hejmady and
Narayanan (
        <xref ref-type="bibr" rid="R76">44</xref>
        ) to discover visual attention patterns
during program debugging with an IDE (Integrated
development environment). They conducted their study with 19
participants but they did not know how their results could
be affected if they had less or more participants. As it is
more reliable to report stable results, it is worthwhile to
investigate the effects on the number of users on other
existing scanpath analysis algorithms.</p>

      <p>This study in not without limitations. The home page
of six web pages were used. Although these web pages
had different levels of visual complexities, more web
pages with more complicated and different designs would
be better to increase the generalizability of our findings.
In this paper, we provided the results of the six web pages
together as illustrated in Figures 10 and 11. However, the
visual complexity and text density may affect the results.
If we had more web pages in each visual complexity
group (low, medium and high), we would be able to
investigate whether the visual complexity of web pages
affects the possibility of achieving almost the same results
with a less number of users. We currently have two web
pages in each visual complexity group. Thus, if we try to
investigate the effects of the visual complexity, we will
have only two pages in each visual complexity group and
therefore the results would not be representative.
Likewise, if we try to investigate the effects of the text
density, we will have only one page with lower density (Apple:
86 words) and two pages with higher density (Yahoo: 353
word, BBC: 300 words), and therefore the results would
not be conclusive. However, it would be interesting to
investigate the effects of the visual complexity and text
density on the possibility of achieving almost the same
results with a less number of users. We are planning to
explore these possible effects in the future studies. One
can design a study to take into consideration both visual
complexity and text density as features in selecting web
pages</p>

      <p>Most of the participants were university students and
daily web users. Even though there were many
participants with different backgrounds, the increase in the
diversity of the users would be better to draw stronger
conclusions. Furthermore, we only used the Levenshtein
Distance algorithm with a substitution cost matrix to
compute similarities between scanpaths. Albeit the
Levenshtein Distance algorithm has widely been used in eye
tracking research, other similarity measures, such as
Needleman and Wunsch (
        <xref ref-type="bibr" rid="R77">45</xref>
        ), can also be used in the
future and the findings can be cross-checked.</p>

      <p>To sum up, this work is a step forward for better
understanding of how the number of users affect scanpath
analysis. It would benefit to both eye tracking and human
computer interaction (HCI) researchers to estimate the
ideal sample size for their studies by considering their
time and budget when they want to conduct scanpath
trend analysis. The methodology used is also promising
for researchers who developed a scanpath analysis
algorithm and want to investigate the effects of the number of
users on the results of their algorithms.</p>
    </sec>
	
    <sec id="S7">
      <title>Conclusion</title>
	  
      <p>In this paper, we show that we can approximate
almost the same results with a smaller group of users in
scanpath trend analysis regardless of the segmentation
approach used. The current findings concur with the
findings of Eraslan, Yesilada (
        <xref ref-type="bibr" rid="R38">6</xref>
        ) suggesting that it is possible
to achieve almost the same results with fewer users.
Specifically, the findings of Eraslan, Yesilada (
        <xref ref-type="bibr" rid="R38">6</xref>
        ) suggest that
it is possible to achieve 75% similarity to the results of 65
users with 27 users for searching tasks and 34 users for
browsing tasks. In this paper, we investigated both the
automatic and manual segmentation effects on the
findings. Based on our experiments, we can suggest that our
current findings are mainly consistent with the previous
findings.</p>

      <p>Although it might not be possible to provide exactly
the same results with fewer users, 75% similarity is a
promising value for practitioners because it means that
approximately 3/4 of the results are the same. For
example, the similarity between the scanpaths
ABCDBADBDEFE and ABCEBADCDEF is calculated
as 75% with the standard String-edit algorithm (
        <xref ref-type="bibr" rid="R73">41</xref>
        ). The
practical usefulness of 75% and other similarities will be
investigated in the future. This paper allows practitioners
to know more about possible differences when they have
lower sample size, thus it helps them to determine their
sample size based on their time and the budget of their
studies.</p>

      <sec id="S7a" sec-type="COI-statement">
        <title>Ethics and Conflict of Interest</title>
		
        <p>The authors declare that the contents of the article are
in agreement with the ethics described in
<ext-link ext-link-type="uri" xlink:href="http://biblio.unibe.ch/portale/elibrary/BOP/jemr/ethics.html" xlink:show="new">http://biblio.unibe.ch/portale/elibrary/BOP/jemr/ethics.html</ext-link> 
and that there is no conflict of interest regarding the
publication of this paper.</p>
      </sec>
	  
      <sec id="S7b">
        <title>Acknowledgements</title>
		
        <p>The dataset used is from our eye tracking study that
was approved by the Ethics Committee of School of
Computer Science at the University of Manchester
(approval ID: CS90). We would like to thank all our
participants for their time and effort.</p>
      </sec>
    </sec>

	

  </body>
  
  <back>
  
  <app-group>
	<app>	
      <title>Appendix</title>
	  
	    
		<sec><title>The Web Pages Used in Our Eye Tracking Study</title>
	
<fig id="fig13" fig-type="figure" position="float">
					<label>Figure 13.</label>
					<caption>
						<p>The Apple web page</p>
					</caption>
					<graphic id="graph13" xlink:href="jemr-10-04-f-figure-13.png"/>
				</fig>
				
<fig id="fig14" fig-type="figure" position="float">
					<label>Figure 14.</label>
					<caption>
						<p>The Babylon web page</p>
					</caption>
					<graphic id="graph14" xlink:href="jemr-10-04-f-figure-14.png"/>
				</fig>	

<fig id="fig15" fig-type="figure" position="float">
					<label>Figure 15.</label>
					<caption>
						<p>The AVG web page</p>
					</caption>
					<graphic id="graph15" xlink:href="jemr-10-04-f-figure-15.png"/>
				</fig>

<fig id="fig16" fig-type="figure" position="float">
					<label>Figure 16.</label>
					<caption>
						<p>The Yahoo web page</p>
					</caption>
					<graphic id="graph16" xlink:href="jemr-10-04-f-figure-16.png"/>
				</fig>

<fig id="fig17" fig-type="figure" position="float">
					<label>Figure 17.</label>
					<caption>
						<p>The Godaddy web page</p>
					</caption>
					<graphic id="graph17" xlink:href="jemr-10-04-f-figure-17.png"/>
				</fig>

<fig id="fig18" fig-type="figure" position="float">
					<label>Figure 18.</label>
					<caption>
						<p>The BBC web page</p>
					</caption>
					<graphic id="graph18" xlink:href="jemr-10-04-f-figure-18.png"/>
				</fig>												
	</sec>	
	</app>
</app-group>		
<ref-list>
<ref id="R41"><label>9</label><mixed-citation publication-type="conference" specific-use="unparsed"><person-group person-group-type="author"><string-name><surname>Akpınar</surname>, <given-names>M. E.</given-names></string-name>, &amp; <string-name><surname>Yeşilada</surname>, <given-names>Y.</given-names></string-name></person-group> (<year>2013</year>). Vision Based Page Segmentation Algorithm: Extended and Perceived Success. In Q. Z. Sheng &amp; J. Kjeldskov (Eds.), Current Trends in Web Engineering: ICWE 2013 International Workshops ComposableWeb, QWE, MDWE, DMSSW, EMotions, CSE, SSN, and PhD Symposium, Aalborg, Denmark, July 8-12, 2013. Revised Selected Papers (pp. 238-252). Cham: Springer International Publishing. https://doi.org/<pub-id pub-id-type="doi">10.1007/978-3-319-04244-2_22</pub-id></mixed-citation></ref>
<ref id="R34"><label>2</label><mixed-citation publication-type="web-page" specific-use="linked"><person-group person-group-type="author"><string-name><surname>Akpınar</surname>, <given-names>M. E.</given-names></string-name>, &amp; <string-name><surname>Yeşilada</surname>, <given-names>Y.</given-names></string-name></person-group> (<year>2015</year>). "<article-title>Old Habits Die Hard!</article-title>": Eyetracking Based Experiential Transcoding: A Study with Mobile Users. Paper presented at the Proceedings of the 12th Web for All Conference, New York, NY, USA. <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1145/2745555.2746646">http://doi.acm.org/10.1145/2745555.2746646</ext-link> <pub-id pub-id-type="doi">10.1145/2745555.2746646</pub-id></mixed-citation></ref>
<ref id="R52"><label>20</label><mixed-citation publication-type="web-page" specific-use="linked"><person-group person-group-type="author"><string-name><surname>Alroobaea</surname>, <given-names>R.</given-names></string-name>, &amp; <string-name><surname>Mayhew</surname>, <given-names>P. J.</given-names></string-name></person-group> (<year>2014</year>). <article-title>How many participants are really enough for usability studies?</article-title> Paper presented at the 2014 Science and Information Conference. <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1109/SAI.2014.6918171">http://doi.acm.org/10.1109/SAI.2014.6918171</ext-link> <pub-id pub-id-type="doi">10.1109/SAI.2014.6918171</pub-id></mixed-citation></ref>
<ref id="R53"><label>21</label><mixed-citation publication-type="web-page" specific-use="unparsed"><person-group person-group-type="author"><string-name><surname>Bevan</surname>, <given-names>N.</given-names></string-name>, <string-name><surname>Barnum</surname>, <given-names>C.</given-names></string-name>, <string-name><surname>Cockton</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Nielsen</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Spool</surname>, <given-names>J.</given-names></string-name>, &amp; <string-name><surname>Wixon</surname>, <given-names>D.</given-names></string-name></person-group> (<year>2003</year>). The "Magic Number 5": Is It Enough for Web Testing? Paper presented at the CHI '03 Extended Abstracts on Human Factors in Computing Systems, New York, NY, USA. <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1145/765891.765936">http://doi.acm.org/10.1145/765891.765936</ext-link></mixed-citation></ref>
<ref id="R70"><label>38</label><mixed-citation publication-type="web-page" specific-use="linked"><person-group person-group-type="author"><string-name><surname>Bing</surname>, <given-names>L.</given-names></string-name>, <string-name><surname>Guo</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Lam</surname>, <given-names>W.</given-names></string-name>, <string-name><surname>Niu</surname>, <given-names>Z.-Y.</given-names></string-name>, &amp; <string-name><surname>Wang</surname>, <given-names>H.</given-names></string-name></person-group> (<year>2014</year>). <article-title>Web Page Segmentation with Structured Prediction and Its Application in Web Page Classification.</article-title> Paper presented at the Proceedings of the 37th International ACM SIGIR Conference on Research &amp; Development in Information Retrieval, New York, NY, USA. <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1145/2600428.2609630">http://doi.acm.org/10.1145/2600428.2609630</ext-link> <pub-id pub-id-type="doi">10.1145/2600428.2609630</pub-id></mixed-citation></ref>
<ref id="R35"><label>3</label><mixed-citation publication-type="web-page" specific-use="linked"><person-group person-group-type="author"><string-name><surname>Brown</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Jay</surname>, <given-names>C.</given-names></string-name>, &amp; <string-name><surname>Harper</surname>, <given-names>S.</given-names></string-name></person-group> (<year>2010</year>). <article-title>Audio Access to Calendars.</article-title> Paper presented at the Proceedings of the 2010 International Cross Disciplinary Conference on Web Accessibility (W4A), New York, NY, USA. <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1145/1805986.1806028">http://doi.acm.org/10.1145/1805986.1806028</ext-link> <pub-id pub-id-type="doi">10.1145/1805986.1806028</pub-id></mixed-citation></ref>
<ref id="R55"><label>23</label><mixed-citation publication-type="journal" specific-use="restruct"><person-group person-group-type="author"><string-name><surname>Caulton</surname>, <given-names>D. A.</given-names></string-name></person-group> (<year>2001</year>). <article-title>Relaxing the homogeneity assumption in usability testing.</article-title> <source>Behaviour &amp; Information Technology</source>, <volume>20</volume>(<issue>1</issue>), <fpage>1</fpage>–<lpage>7</lpage>. <pub-id pub-id-type="doi">10.1080/01449290010020648</pub-id><issn>0144-929X</issn></mixed-citation></ref>
<ref id="R54"><label>22</label><mixed-citation publication-type="web-page" specific-use="unparsed"><person-group person-group-type="author"><string-name><surname>Cazañas</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>de San Miguel</surname>, <given-names>A.</given-names></string-name>, &amp; <string-name><surname>Parra</surname>, <given-names>E.</given-names></string-name></person-group> (<year>2017</year>). <article-title>Estimating Sample Size for Usability Testing.</article-title> Enfoqute(8), 172-185. <ext-link ext-link-type="uri" xlink:href="http://ingenieria.ute.edu.ec/enfoqueute/index.php/revista/article/view/126/133">http://ingenieria.ute.edu.ec/enfoqueute/index.php/revista/article/view/126/133</ext-link></mixed-citation></ref>
<ref id="R43"><label>11</label><mixed-citation publication-type="journal" specific-use="restruct"><person-group person-group-type="author"><string-name><surname>Collberg</surname>, <given-names>C.</given-names></string-name>, &amp; <string-name><surname>Proebsting</surname>, <given-names>T. A.</given-names></string-name></person-group> (<year>2016</year>). <article-title>Repeatability in Computer Systems Research.</article-title> <source>Communications of the ACM</source>, <volume>59</volume>(<issue>3</issue>), <fpage>62</fpage>–<lpage>69</lpage>. Retrieved from <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1145/2812803">http://doi.acm.org/10.1145/2812803</ext-link> <pub-id pub-id-type="doi">10.1145/2812803</pub-id><issn>0001-0782</issn></mixed-citation></ref>
<ref id="R72"><label>40</label><mixed-citation publication-type="journal" specific-use="restruct"><person-group person-group-type="author"><string-name><surname>Cuzzolaa</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Jovanović</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Bagheri</surname>, <given-names>E.</given-names></string-name>, &amp; <string-name><surname>Gašević</surname>, <given-names>D.</given-names></string-name></person-group> (<year>2015</year>). <article-title>Automated Classification and Localization of Daily Deal Content from the Web.</article-title> <source>Applied Soft Computing</source>, <volume>31</volume>(<issue>C</issue>), <fpage>241</fpage>–<lpage>256</lpage>. <pub-id pub-id-type="doi">10.1016/j.asoc.2015.02.029</pub-id><issn>1568-4946</issn></mixed-citation></ref>
<ref id="R33"><label>1</label><mixed-citation publication-type="web-page" specific-use="unparsed"><person-group person-group-type="author"><string-name><surname>Ehmke</surname>, <given-names>C.</given-names></string-name>, &amp; <string-name><surname>Wilson</surname>, <given-names>S.</given-names></string-name></person-group> (<year>2007</year>). <article-title>Identifying Web Usability Problems from Eye-tracking Data.</article-title> Paper presented at the Proceedings of the 21st British HCI Group Annual Conference on People and Computers: HCI...But Not As We Know It - Volume 1, Swinton, UK, UK. <ext-link ext-link-type="uri" xlink:href="http://dl.acm.org/citation.cfm?id=1531294.1531311">http://dl.acm.org/citation.cfm?id=1531294.1531311</ext-link></mixed-citation></ref>
<ref id="R58"><label>26</label><mixed-citation publication-type="web-page" specific-use="unparsed"><person-group person-group-type="author"><string-name><surname>Eraslan</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Yesilada</surname>, <given-names>Y.</given-names></string-name>, &amp; <string-name><surname>Harper</surname>, <given-names>S.</given-names></string-name></person-group> (<year>2015</year>). <article-title>Eye tracking scanpath analysis techniques on web pages: A survey, evaluation and comparison.</article-title> Journal of Eye Movement Research, 9(1). <ext-link ext-link-type="uri" xlink:href="https://bop.unibe.ch/index.php/JEMR/article/view/243">https://bop.unibe.ch/index.php/JEMR/article/view/243</ext-link> 0</mixed-citation></ref>
<ref id="R38"><label>6</label><mixed-citation publication-type="web-page" specific-use="unparsed"><person-group person-group-type="author"><string-name><surname>Eraslan</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Yesilada</surname>, <given-names>Y.</given-names></string-name>, &amp; <string-name><surname>Harper</surname>, <given-names>S.</given-names></string-name></person-group> (<year>2016</year>a). Eye Tracking Scanpath Analysis on Web Pages: How Many Users? Paper presented at the Proceedings of the Ninth Biennial ACM Symposium on Eye Tracking Research &amp; Applications, New York, NY, USA. <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1145/2857491.2857519">http://doi.acm.org/10.1145/2857491.2857519</ext-link></mixed-citation></ref>
<ref id="R40"><label>8</label><mixed-citation publication-type="web-page" specific-use="unparsed"><person-group person-group-type="author"><string-name><surname>Eraslan</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Yesilada</surname>, <given-names>Y.</given-names></string-name>, &amp; <string-name><surname>Harper</surname>, <given-names>S.</given-names></string-name></person-group> (<year>2016</year>b). Scanpath Trend Analysis on Web Pages: Clustering Eye Tracking Scanpaths. ACM Trans. Web, 10(4), 20:21- 20:35. <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1145/2970818">http://doi.acm.org/10.1145/2970818</ext-link></mixed-citation></ref>
<ref id="R61"><label>29</label><mixed-citation publication-type="web-page" specific-use="unparsed"><person-group person-group-type="author"><string-name><surname>Eraslan</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Yesilada</surname>, <given-names>Y.</given-names></string-name>, &amp; <string-name><surname>Harper</surname>, <given-names>S.</given-names></string-name></person-group> (<year>2016</year>c). Trends in Eye Tracking Scanpaths: Segmentation Effect? Paper presented at the Proceedings of the 27th ACM Conference on Hypertext and Social Media, New York, NY, USA. <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1145/2914586.2914591">http://doi.acm.org/10.1145/2914586.2914591</ext-link></mixed-citation></ref>
<ref id="R50"><label>18</label><mixed-citation publication-type="journal" specific-use="restruct"><person-group person-group-type="author"><string-name><surname>Faulkner</surname>, <given-names>L.</given-names></string-name></person-group> (<year>2003</year>). <article-title>Beyond the five-user assumption: Benefits of increased sample sizes in usability testing.</article-title> <source>Behavior Research Methods, Instruments, &amp; Computers</source>, <volume>35</volume>(<issue>3</issue>), <fpage>379</fpage>–<lpage>383</lpage>. <pub-id pub-id-type="doi">10.3758/BF03195514</pub-id><pub-id pub-id-type="pmid">14587545</pub-id><issn>0743-3808</issn></mixed-citation></ref>
<ref id="R71"><label>39</label><mixed-citation publication-type="web-page" specific-use="linked"><person-group person-group-type="author"><string-name><surname>Feng</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Zhang</surname>, <given-names>W.</given-names></string-name>, <string-name><surname>Wu</surname>, <given-names>H.</given-names></string-name>, &amp; <string-name><surname>Wang</surname>, <given-names>C. J.</given-names></string-name></person-group> (<year>2016</year>). <article-title>Web Page Segmentation and Its Application for Web Information Crawling.</article-title> Paper presented at the 2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI). <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1109/ICTAI.2016.0097">http://doi.acm.org/10.1109/ICTAI.2016.0097</ext-link> <pub-id pub-id-type="doi">10.1109/ICTAI.2016.0097</pub-id></mixed-citation></ref>
<ref id="R76"><label>44</label><mixed-citation publication-type="web-page" specific-use="linked"><person-group person-group-type="author"><string-name><surname>Hejmady</surname>, <given-names>P.</given-names></string-name>, &amp; <string-name><surname>Narayanan</surname>, <given-names>N. H.</given-names></string-name></person-group> (<year>2012</year>). <article-title>Visual Attention Patterns During Program Debugging with an IDE.</article-title> Paper presented at the Proceedings of the Symposium on Eye Tracking Research and Applications, New York, NY, USA. <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1145/2168556.2168592">http://doi.acm.org/10.1145/2168556.2168592</ext-link> <pub-id pub-id-type="doi">10.1145/2168556.2168592</pub-id></mixed-citation></ref>
<ref id="R51"><label>19</label><mixed-citation publication-type="journal" specific-use="restruct"><person-group person-group-type="author"><string-name><surname>Hwang</surname>, <given-names>W.</given-names></string-name>, &amp; <string-name><surname>Salvendy</surname>, <given-names>G.</given-names></string-name></person-group> (<year>2010</year>). <article-title>Number of People Required for Usability Evaluation: The 10±2 Rule.</article-title> <source>Communications of the ACM</source>, <volume>53</volume>(<issue>5</issue>), <fpage>130</fpage>–<lpage>133</lpage>. Retrieved from <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1145/1735223.1735255">http://doi.acm.org/10.1145/1735223.1735255</ext-link> <pub-id pub-id-type="doi">10.1145/1735223.1735255</pub-id><issn>0001-0782</issn></mixed-citation></ref>
<ref id="R37"><label>5</label><mixed-citation publication-type="book-chapter" specific-use="restruct"><person-group person-group-type="author"><string-name><surname>Jay</surname>, <given-names>C.</given-names></string-name>, <string-name><surname>Lunn</surname>, <given-names>D.</given-names></string-name>, &amp; <string-name><surname>Michailidou</surname>, <given-names>E.</given-names></string-name></person-group> (<year>2008</year>). <chapter-title>End User Evaluations</chapter-title>. In <person-group person-group-type="editor"><string-name><given-names>S.</given-names> <surname>Harper</surname></string-name> &amp; <string-name><given-names>Y.</given-names> <surname>Yesilada</surname></string-name> (<role>Eds.</role>),</person-group> <source>Web Accessibility: A Foundation for Research</source> (pp. <fpage>107</fpage>–<lpage>126</lpage>). <publisher-loc>London</publisher-loc>: <publisher-name>Springer London</publisher-name>; <pub-id pub-id-type="doi">10.1007/978-1-84800-050-6_8</pub-id></mixed-citation></ref>
<ref id="R46"><label>14</label><mixed-citation publication-type="book" specific-use="restruct"><person-group person-group-type="author"><string-name><surname>Johnson</surname>, <given-names>J.</given-names></string-name></person-group> (<year>2010</year>). <source>Designing with the Mind in Mind: Simple Guide to Understanding User Interface Design Rules</source>. <publisher-loc>San Francisco, CA, USA</publisher-loc>: <publisher-name>Morgan Kaufmann Publishers Inc.</publisher-name></mixed-citation></ref>
<ref id="R73"><label>41</label><mixed-citation publication-type="web-page" specific-use="unparsed"><person-group person-group-type="author"><string-name><surname>Josephson</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Holmes</surname>, <given-names>M. E.</given-names></string-name></person-group> (<year>2002</year>). Visual Attention to Repeated Internet Images: Testing the Scanpath Theory on the World Wide Web. Paper presented at the Proceedings of the 2002 Symposium on Eye Tracking Research &amp; Applications, New York, NY, USA. <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1145/507072.507081">http://doi.acm.org/10.1145/507072.507081</ext-link></mixed-citation></ref>
<ref id="R63"><label>31</label><mixed-citation publication-type="web-page" specific-use="linked"><person-group person-group-type="author"><string-name><surname>Kohlschütter</surname>, <given-names>C.</given-names></string-name>, <string-name><surname>Fankhauser</surname>, <given-names>P.</given-names></string-name>, &amp; <string-name><surname>Nejdl</surname>, <given-names>W.</given-names></string-name></person-group> (<year>2010</year>). <article-title>Boilerplate Detection Using Shallow Text Features.</article-title> Paper presented at the Proceedings of the Third ACM International Conference on Web Search and Data Mining, New York, NY, USA. <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1145/1718487.1718542">http://doi.acm.org/10.1145/1718487.1718542</ext-link> <pub-id pub-id-type="doi">10.1145/1718487.1718542</pub-id></mixed-citation></ref>
<ref id="R69"><label>37</label><mixed-citation publication-type="conference" specific-use="linked"><person-group person-group-type="author"><string-name><surname>Kreuzer</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Hage</surname>, <given-names>J.</given-names></string-name>, &amp; <string-name><surname>Feelders</surname>, <given-names>A.</given-names></string-name></person-group> (<year>2015</year>). <article-title>A Quantitative Comparison of Semantic Web Page Segmentation Approaches.</article-title> In <person-group person-group-type="editor"><string-name><given-names>P.</given-names> <surname>Cimiano</surname></string-name>, <string-name><given-names>F.</given-names> <surname>Frasincar</surname></string-name>, <string-name><given-names>G.-J.</given-names> <surname>Houben</surname></string-name>, &amp; <string-name><given-names>D.</given-names> <surname>Schwabe</surname></string-name> <role>(Eds.)</role></person-group>, Engineering the Web in the Big Data Era: <source>15th International Conference, ICWE 2015</source>, <conf-loc>Rotterdam, The Netherlands</conf-loc>, <conf-date>June 23-26, 2015</conf-date>, Proceedings (pp. <fpage>374</fpage>-<lpage>391</lpage>). <publisher-loc>Cham</publisher-loc>: <publisher-name>Springer International Publishing</publisher-name>. https://doi.org/<pub-id pub-id-type="doi">10.1007/978-3-319-19890-3_24</pub-id></mixed-citation></ref>
<ref id="R48"><label>16</label><mixed-citation publication-type="journal" specific-use="restruct"><person-group person-group-type="author"><string-name><surname>Lewis</surname>, <given-names>J. R.</given-names></string-name></person-group> (<year>1982</year>). <article-title>Testing small system customer setup.</article-title> <source>Proceedings of the Human Factors and Ergonomics Society Annual Meeting</source>, <volume>26</volume>(<issue>8</issue>), <fpage>718</fpage>–<lpage>720</lpage>. <pub-id pub-id-type="doi">10.1177/154193128202600810</pub-id><issn>1071-1813</issn></mixed-citation></ref>
<ref id="R59"><label>27</label><mixed-citation publication-type="web-page" specific-use="unparsed"><person-group person-group-type="author"><string-name><surname>Michailidou</surname>, <given-names>E.</given-names></string-name></person-group> (<year>2006</year>). ViCRAM: Visual Complexity Rankings and Accessibility Metrics. SIGACCESS Access. Comput.(86), 24-27. <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1145/1196148.1196154">http://doi.acm.org/10.1145/1196148.1196154</ext-link></mixed-citation></ref>
<ref id="R65"><label>33</label><mixed-citation publication-type="web-page" specific-use="linked"><person-group person-group-type="author"><string-name><surname>Michailidou</surname>, <given-names>E.</given-names></string-name>, <string-name><surname>Harper</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Bechhofer</surname>, <given-names>S.</given-names></string-name></person-group> (<year>2008</year>). <article-title>Visual Complexity and Aesthetic Perception of Web Pages.</article-title> Paper presented at the Proceedings of the 26th Annual ACM International Conference on Design of Communication, New York, NY, USA. <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1145/1456536.1456581">http://doi.acm.org/10.1145/1456536.1456581</ext-link> <pub-id pub-id-type="doi">10.1145/1456536.1456581</pub-id></mixed-citation></ref>
<ref id="R64"><label>32</label><mixed-citation publication-type="web-page" specific-use="unparsed"><person-group person-group-type="author"><string-name><surname>Milic-Frayling</surname>, <given-names>N.</given-names></string-name>, &amp; <string-name><surname>Sommerer</surname>, <given-names>R.</given-names></string-name></person-group> (<year>2002</year>). SmartView: Flexible Viewing of Web Page Contents. Paper presented at the Poster Proceedings of the Eleventh International World Wide Web Conference. <ext-link ext-link-type="uri" xlink:href="http://www2002.org/CDROM/poster/172/index.html">http://www2002.org/CDROM/poster/172/index.html</ext-link></mixed-citation></ref>
<ref id="R77"><label>45</label><mixed-citation publication-type="journal" specific-use="restruct"><person-group person-group-type="author"><string-name><surname>Needleman</surname>, <given-names>S. B.</given-names></string-name>, &amp; <string-name><surname>Wunsch</surname>, <given-names>C. D.</given-names></string-name></person-group> (<year>1970</year>). <article-title>A general method applicable to the search for similarities in the amino acid sequence of two proteins.</article-title> <source>Journal of Molecular Biology</source>, <volume>48</volume>(<issue>3</issue>), <fpage>443</fpage>–<lpage>453</lpage>. <pub-id pub-id-type="doi">10.1016/0022-2836(70)90057-4</pub-id><pub-id pub-id-type="pmid">5420325</pub-id><issn>0022-2836</issn></mixed-citation></ref>
<ref id="R49"><label>17</label><mixed-citation publication-type="web-page" specific-use="linked"><person-group person-group-type="author"><string-name><surname>Nielsen</surname>, <given-names>J.</given-names></string-name>, &amp; <string-name><surname>Landauer</surname>, <given-names>T. K.</given-names></string-name></person-group> (<year>1993</year>). <article-title>A Mathematical Model of the Finding of Usability Problems.</article-title> Paper presented at the Proceedings of the INTERACT '93 and CHI '93 Conference on Human Factors in Computing Systems, New York, NY, USA. <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1145/169059.169166">http://doi.acm.org/10.1145/169059.169166</ext-link> <pub-id pub-id-type="doi">10.1145/169059.169166</pub-id></mixed-citation></ref>
<ref id="R45"><label>13</label><mixed-citation publication-type="web-page" specific-use="unparsed"><person-group person-group-type="author"><string-name><surname>Owens</surname>, <given-names>J. W.</given-names></string-name>, <string-name><surname>Chaparro</surname>, <given-names>B. S.</given-names></string-name>, &amp; <string-name><surname>Palmer</surname>, <given-names>E. M.</given-names></string-name></person-group> (<year>2011</year>). Text Advertising Blindness: The New Banner Blindness? J. Usability Studies, 6(3), 12:172-112:197. <ext-link ext-link-type="uri" xlink:href="http://dl.acm.org/citation.cfm?id=2007456.2007460">http://dl.acm.org/citation.cfm?id=2007456.2007460</ext-link></mixed-citation></ref>
<ref id="R57"><label>25</label><mixed-citation publication-type="web-page" specific-use="parsed"><person-group person-group-type="author"><string-name><surname>Pernice</surname>, <given-names>K.</given-names></string-name>, &amp; <string-name><surname>Nielsen</surname>, <given-names>J.</given-names></string-name></person-group> (<year>2009</year>). <article-title>How to Conduct Eyetracking Studies.</article-title> Retrieved from <ext-link ext-link-type="uri" xlink:href="https://media.nngroup.com/media/reports/free/How_to_Conduct_Eyetracking_Studies.pdf">https://media.nngroup.com/media/reports/free/How_to_Conduct_Eyetracking_Studies.pdf</ext-link></mixed-citation></ref>
<ref id="R62"><label>30</label><mixed-citation publication-type="web-page" specific-use="unparsed"><person-group person-group-type="author"><string-name><surname>Sanoja</surname>, <given-names>A.</given-names></string-name>, &amp; <string-name><surname>Gançarski</surname>, <given-names>S.</given-names></string-name></person-group> (<year>2013</year>). <article-title>Block-o-Matic: a Web Page Segmentation Tool and its Evaluation.</article-title> In: 29`eme journ’ees "Base de donn’ees avanc’ees", BDA’13. <ext-link ext-link-type="uri" xlink:href="https://hal.archives-ouvertes.fr/hal-">https://hal.archives-ouvertes.fr/hal-</ext-link> 00881693</mixed-citation></ref>
<ref id="R44"><label>12</label><mixed-citation publication-type="web-page" specific-use="linked"><person-group person-group-type="author"><string-name><surname>Sanoja</surname>, <given-names>A.</given-names></string-name>, &amp; <string-name><surname>Gançarski</surname>, <given-names>S.</given-names></string-name></person-group> (<year>2014</year>). <article-title>Block-o-Matic: A web page segmentation framework.</article-title> Paper presented at the 2014 International Conference on Multimedia Computing and Systems (ICMCS). <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1109/ICMCS.2014.6911249">http://doi.acm.org/10.1109/ICMCS.2014.6911249</ext-link> <pub-id pub-id-type="doi">10.1109/ICMCS.2014.6911249</pub-id></mixed-citation></ref>
<ref id="R47"><label>15</label><mixed-citation publication-type="journal" specific-use="restruct"><person-group person-group-type="author"><string-name><surname>Schmettow</surname>, <given-names>M.</given-names></string-name></person-group> (<year>2012</year>). <article-title>Sample Size in Usability Studies.</article-title> <source>Communications of the ACM</source>, <volume>55</volume>(<issue>4</issue>), <fpage>64</fpage>–<lpage>70</lpage>. Retrieved from <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1145/2133806.2133824">http://doi.acm.org/10.1145/2133806.2133824</ext-link> <pub-id pub-id-type="doi">10.1145/2133806.2133824</pub-id><issn>0001-0782</issn></mixed-citation></ref>
<ref id="R56"><label>24</label><mixed-citation publication-type="web-page" specific-use="unparsed"><person-group person-group-type="author"><string-name><surname>Spool</surname>, <given-names>J.</given-names></string-name>, &amp; <string-name><surname>Schroeder</surname>, <given-names>W.</given-names></string-name></person-group> (<year>2001</year>). Testing Web Sites: Five Users is Nowhere Near Enough. Paper presented at the CHI '01 Extended Abstracts on Human Factors in Computing Systems, New York, NY, USA. <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1145/634067.634236">http://doi.acm.org/10.1145/634067.634236</ext-link></mixed-citation></ref>
<ref id="R60"><label>28</label><mixed-citation publication-type="book" specific-use="unparsed"><person-group person-group-type="author"><string-name><surname>Spool</surname>, <given-names>J. M.</given-names></string-name>, <string-name><surname>Scanlon</surname>, <given-names>T.</given-names></string-name>, <string-name><surname>Snyder</surname>, <given-names>C.</given-names></string-name>, <string-name><surname>Carolyn</surname>, <given-names>W.</given-names></string-name>, &amp; <string-name><surname>DeAngelo</surname>, <given-names>T.</given-names></string-name></person-group> (<year>1999</year>). Web Site Usability: A Designer's Guide: Morgan Kaufmann Publishers Inc. San Francisco, CA, USA. 1-55860-569-X</mixed-citation></ref>
<ref id="R74"><label>42</label><mixed-citation publication-type="web-page" specific-use="linked"><person-group person-group-type="author"><string-name><surname>Takeuchi</surname>, <given-names>H.</given-names></string-name>, &amp; <string-name><surname>Habuchi</surname>, <given-names>Y.</given-names></string-name></person-group> (<year>2007</year>). <article-title>A Quantitative Method for Analyzing Scan Path Data Obtained by Eye Tracker.</article-title> Paper presented at the 2007 IEEE Symposium on Computational Intelligence and Data Mining. <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1109/CIDM.2007.368885">http://doi.acm.org/10.1109/CIDM.2007.368885</ext-link> <pub-id pub-id-type="doi">10.1109/CIDM.2007.368885</pub-id></mixed-citation></ref>
<ref id="R42"><label>10</label><mixed-citation publication-type="book" specific-use="unparsed"><person-group person-group-type="author"><string-name><surname>Tobii Technology</surname> <given-names>AB</given-names></string-name></person-group>. (<year>2010</year>). Tobii StudioTM 2.X User Manual (Sep. 2010): Tobii Technology AB.</mixed-citation></ref>
<ref id="R68"><label>36</label><mixed-citation publication-type="conference" specific-use="linked"> <person-group person-group-type="author"><string-name><surname>Toh</surname>, <given-names>H. J.</given-names></string-name>, &amp; <string-name><surname>Hong</surname>, <given-names>J. L.</given-names></string-name></person-group> (<year>2014</year>). <article-title>Webpage Segmentation Using Ontology and Word Matching.</article-title> In <person-group person-group-type="editor"><string-name><given-names>C. K.</given-names> <surname>Loo</surname></string-name>, <string-name><given-names>K. S.</given-names> <surname>Yap</surname></string-name>, <string-name><given-names>K. W.</given-names> <surname>Wong</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Teoh</surname></string-name>, &amp; <string-name><given-names>K.</given-names> <surname>Huang</surname></string-name> <role>(Eds.)</role></person-group>, Neural Information Processing: <source>21st International Conference, ICONIP 2014</source>, <conf-loc>Kuching, Malaysia</conf-loc>, <conf-date>November 3-6, 2014</conf-date>. Proceedings, Part II (pp. <fpage>454</fpage>- <lpage>461</lpage>). <publisher-loc>Cham</publisher-loc>: <publisher-name>Springer International Publishing</publisher-name>. https://doi.org/<pub-id pub-id-type="doi">10.1007/978-3-319-12640-1_55</pub-id></mixed-citation></ref>
<ref id="R75"><label>43</label><mixed-citation publication-type="conference" specific-use="linked"><person-group person-group-type="author"><string-name><surname>Underwood</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Humphrey</surname>, <given-names>K.</given-names></string-name>, &amp; <string-name><surname>Foulsham</surname>, <given-names>T.</given-names></string-name></person-group> (<year>2008</year>). <article-title>Knowledge-Based Patterns of Remembering: Eye Movement Scanpaths Reflect Domain Experience.</article-title> In <source>Proceedings of the 4th Symposium of the Workgroup Human-Computer Interaction and Usability Engineering of the Austrian Computer Society on HCI and Usability for Education and Work</source> (pp. <fpage>125</fpage>-<lpage>144</lpage>). <publisher-loc>Berlin, Heidelberg</publisher-loc>: <publisher-name>Springer-Verlag</publisher-name>. http://dx.doi.org/<pub-id pub-id-type="doi">10.1007/978-3-540-89350-9_10</pub-id></mixed-citation></ref>
<ref id="R66"><label>34</label><mixed-citation publication-type="conference" specific-use="linked"><person-group person-group-type="author"><string-name><surname>Wei</surname>, <given-names>T.</given-names></string-name>, <string-name><surname>Lu</surname>, <given-names>Y.</given-names></string-name>, <string-name><surname>Li</surname>, <given-names>X.</given-names></string-name>, &amp; <string-name><surname>Liu</surname>, <given-names>J.</given-names></string-name></person-group> (<year>2015</year>). <article-title>Web page segmentation based on the hough transform and vision cues.</article-title> Paper presented at the <source>2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)</source>. https://doi.org/<pub-id pub-id-type="doi">10.1109/APSIPA.2015.7415394</pub-id></mixed-citation></ref>
<ref id="R39"><label>7</label><mixed-citation publication-type="web-page" specific-use="parsed"><person-group person-group-type="author"><string-name><surname>Yesilada</surname>, <given-names>Y.</given-names></string-name></person-group> (<year>2011</year>). <article-title>Web Page Segmentation: A Review.</article-title> Retrieved from <ext-link ext-link-type="uri" xlink:href="http://emine.ncc.metu.edu.tr/deliverables/emine_D0.pdf">http://emine.ncc.metu.edu.tr/deliverables/emine_D0.pdf</ext-link></mixed-citation></ref>
<ref id="R36"><label>4</label><mixed-citation publication-type="web-page" specific-use="unparsed"><person-group person-group-type="author"><string-name><surname>Yesilada</surname>, <given-names>Y.</given-names></string-name>, <string-name><surname>Harper</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Eraslan</surname>, <given-names>S.</given-names></string-name></person-group> (<year>2013</year>). Experiential Transcoding: An EyeTracking Approach. Paper presented at the Proceedings of the 10th International Cross-Disciplinary Conference on Web Accessibility, New York, NY, USA. <ext-link ext-link-type="uri" xlink:href="http://doi.acm.org/10.1145/2461121.2461134">http://doi.acm.org/10.1145/2461121.2461134</ext-link></mixed-citation></ref>
<ref id="R67"><label>35</label><mixed-citation publication-type="web-page" specific-use="linked"><person-group person-group-type="author"><string-name><surname>Zeleny</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Burget</surname>, <given-names>R.</given-names></string-name>, &amp; <string-name><surname>Zendulka</surname>, <given-names>J.</given-names></string-name></person-group> (<year>2017</year>). <article-title>Box clustering segmentation: A new method for visionbased web page preprocessing.</article-title> Information Processing &amp; Management, 53(3), 735 - 750. <ext-link ext-link-type="uri" xlink:href="http://www.sciencedirect.com/science/article/pii/S030">http://www.sciencedirect.com/science/article/pii/S030</ext-link> 6457316301169 <pub-id pub-id-type="doi">10.1016/j.ipm.2017.02.002</pub-id></mixed-citation></ref>
	
</ref-list>
  </back>  
</article>


