<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE rfc SYSTEM "rfc2629.dtd" [
<!ENTITY RFC2119    PUBLIC '' 'http://xml.resource.org/public/rfc/bibxml/reference.RFC.2119.xml'>
<!ENTITY RFC2978    PUBLIC '' 'http://xml.resource.org/public/rfc/bibxml/reference.RFC.2978.xml'>
<!ENTITY RFC3501    PUBLIC '' 'http://xml.resource.org/public/rfc/bibxml/reference.RFC.3501.xml'>
<!ENTITY RFC4314    PUBLIC '' 'http://xml.resource.org/public/rfc/bibxml/reference.RFC.4314.xml'>
<!ENTITY RFC4466    PUBLIC '' 'http://xml.resource.org/public/rfc/bibxml/reference.RFC.4466.xml'>
<!ENTITY RFC4731    PUBLIC '' 'http://xml.resource.org/public/rfc/bibxml/reference.RFC.4731.xml'>
<!ENTITY RFC5182    PUBLIC '' 'http://xml.resource.org/public/rfc/bibxml/reference.RFC.5182.xml'>
<!ENTITY RFC5234    PUBLIC '' 'http://xml.resource.org/public/rfc/bibxml/reference.RFC.5234.xml'>
<!ENTITY RFC5267    PUBLIC '' 'http://xml.resource.org/public/rfc/bibxml/reference.RFC.5267.xml'>
<!ENTITY RFC5465    PUBLIC '' 'http://xml.resource.org/public/rfc/bibxml/reference.RFC.5465.xml'>
]>
<?xml-stylesheet type='text/xsl' href='http://xml.resource.org/authoring/rfc2629.xslt' ?>
<?rfc toc="yes" ?>
<?rfc tocompact="no" ?>
<?rfc tocindent="no" ?>
<?rfc comments="yes" ?>
<?rfc inline="yes" ?>
<?rfc symrefs="yes" ?>
<?rfc sortrefs="yes"?>
<?rfc iprnotified="no" ?>
<?rfc strict="yes" ?>
<?rfc compact="yes"?>
<?rfc subcompact="no"?>

<rfc category="exp"
     updates="4466"
     ipr="trust200902"
     docName="draft-ietf-morg-multimailbox-search-07">
  <front>
    <title>IMAP4 Multimailbox SEARCH Extension</title>
    
    <author initials='B.' surname="Leiba" fullname='Barry Leiba'>
      <organization>Huawei Technologies</organization>
      <address>
        <phone>+1 646 827 0648</phone>
        <email>barryleiba@computer.org</email>
        <uri>http://internetmessagingtechnology.org/</uri>
      </address>
    </author>
    <author initials='A.' surname="Melnikov" fullname='Alexey Melnikov'>
      <organization>Isode Limited</organization>
      <address>
        <postal>
          <street>5 Castle Business Village</street>
          <street>36 Station Road</street>
          <city>Hampton</city>
          <region>Middlesex</region>
          <code>TW12 2BX</code>
          <country>UK</country>
        </postal>
        <email>Alexey.Melnikov@isode.com</email>
        <uri>http://www.melnikov.ca/</uri>
      </address>
    </author>
      
    <date year="2011"/>
    <area>Applications</area>
    <workgroup>Message ORGanization Working Group</workgroup>
    <keyword>IMAP</keyword>
    <keyword>email</keyword>
    <keyword>search</keyword>
    <keyword>multiple mailboxes</keyword>
    <keyword>imapext</keyword>
    <abstract>
      <t>
        The IMAP4 specification allows the searching only of the selected
        mailbox.  A user often wants to search multiple mailboxes, and a
        client that wishes to support this must issue a series of SELECT and
        SEARCH commands, waiting for each to complete before moving on to the
        next.  This extension allows a client to search multiple mailboxes
        with one command, limiting the round-trips and waiting for various
        searches to complete, and not requiring disruption of the currently
        selected mailbox.  This also uses MAILBOX and TAG fields in
        ESEARCH responses, allowing a client to pipeline the searches if it chooses.
      </t>
    </abstract>
  </front>

  <middle>
    <section title="Introduction">
      <t>
        The IMAP4 specification allows the searching only of the selected
        mailbox.  A user often wants to search multiple mailboxes, and a
        client that wishes to support this must issue a series of SELECT and
        SEARCH commands, waiting for each to complete before moving on to the
        next.  The commands can't be pipelined, because the server might run
        them in parallel, and the untagged SEARCH responses could not then
        be distinguished from each other.
      </t>

      <t>
        This extension allows a client to search multiple mailboxes
        with one command, and includes MAILBOX and TAG fields in the ESEARCH response,
        yielding the following advantages:
        <list style="symbols">
          <t>
            A single command limits the number of round-trips needed to search
            a set of mailboxes.
          </t>
          <t>
            A single command eliminates the need to wait for one search to complete
            before starting the next.
          </t>
          <t>
            A single command allows the server to optimize the search, if it can.
          </t>
          <t>
            A command that is not dependent upon the selected mailbox eliminates the
            need to disrupt the selection state, or to open another IMAP connection.
          </t>
          <t>
            The MAILBOX and TAG fields in the responses allow a client to distinguish
            which responses go with which search (and which mailbox).
            A client can safely pipeline these search commands without danger of confusion.
          </t>
        </list>
      </t>

      <section title="Conventions used in this document">
        <t>
          In examples, "C:" indicates lines sent by a client that is connected
          to a server.  "S:" indicates lines sent by the server to the client.
        </t>
        <t>
          The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
          "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document
          are to be interpreted as described in RFC 2119 <xref target="RFC2119"/>.
        </t>
      </section>
        
    </section>

    <section title="New ESEARCH command">
      <t>
        <list style="hanging" hangIndent="12">
        <t hangText="Arguments:">OPTIONAL source options<vspace/>
          OPTIONAL result options<vspace/>
          OPTIONAL charset specification (see <xref target="RFC2978"/>)<vspace/>
          searching criteria (one or more)
        </t>
        <t hangText="Responses:">REQUIRED untagged response: ESEARCH<vspace/>
        </t>
        <t hangText="Result:">OK - search completed<vspace/>
          NO - error: cannot search that charset or criteria<vspace/>
          BAD - command unknown or arguments invalid
        </t>
        </list>
      </t>

      <t>
        This section defines a new ESEARCH command, which works similarly to
        the UID SEARCH command described in section 2.6.1 of <xref target="RFC4466"/>
        (initially described in section 6.4.4 of <xref target="RFC3501"/>
        and extended by <xref target="RFC4731"/>).
      </t>

      <t>
        The ESEARCH command further extends searching by allowing for
        optional source and result options.
        This document does not define any new result options
        (see Section 3.1 of <xref target="RFC4731"/>).
        A server that supports this extension includes "MULTISEARCH"
        in its IMAP capability string.        
      </t>

      <t>
        Because there has been confusion about this, it is worth pointing out
        that with ESEARCH, as with *any* SEARCH or UID SEARCH command, it MUST NOT
        be considered an error if the search terms include a range of message
        numbers that extends (or, in fact, starts) beyond the end of the mailbox.
        For example, a client might want to establish a rolling window through
        the search results this way:
      </t>

      <t>
        C: tag1 UID ESEARCH FROM "frobozz" 1:100
      </t>

      <t>
        ...followed later by this:
      </t>

      <t>
        C: tag1 UID ESEARCH FROM "frobozz" 101:200
      </t>

      <t>
        ...and so on.  This tells the server to match only the first hundred messages
        in the mailbox the first time, the second hundred the second time, etc.
        In fact, it might likely allow the server to optimize the search significantly.
        In the above example, whether the mailbox contains 50 or 150 or 250 messages,
        neither of the search commands shown will result in an error.  It is up to
        the client to know when to stop moving its search window.
      </t>

      <t>
      </t>

      <section title="The ESEARCH response">
        <t>
          In response to an ESEARCH command,
          the server MUST return ESEARCH responses <xref target="RFC4731"/>
          (that is, not SEARCH responses).
          Because message numbers are not useful for mailboxes that are not selected,          
          the responses MUST contain information about UIDs, not message numbers.
          This is true even if the source options specify that only the selected mailbox
          be searched.
        </t>

        <t>
          Presence of a source option in absence of a result option implies the "ALL" result
          option (see Section 3.1 of <xref target="RFC4731"/>).  Note that this is not the same
          as the result from the SEARCH command described in the IMAP base protocol.
        </t>

        <t>
          Source options describe which mailboxes must be searched for messages.
          An ESEARCH command with source options does not affect which
          mailbox, if any, is currently selected, regardless of which mailboxes are searched.
        </t>

        <t>
          For each mailbox satisfying the source options, a single
          ESEARCH response MUST be returned
          if any messages in that mailbox match the search criteria.
          An ESEARCH response MUST NOT
          be returned for mailboxes that contain no matching messages.
          This is true even when result options such as MIN, MAX, and COUNT are specified
          (see section 3.1 of <xref target="RFC4731"/>), and the values returned
          (lowest UID matched, highest UID matched, and number of messages matched,
          respectively) apply to the mailbox reported in that ESEARCH response.
        </t>

        <t>
          Note that it is possible for an ESEARCH command to return *no* untagged responses
          (no ESEARCH responses at all), in the case that there are no matches to the search
          in any of the mailboxes that satisfy the source options.
          Clients can detect this situation by finding the tagged OK response without having
          received any matching untagged ESEARCH responses.
        </t>

        <t>
          Each ESEARCH response MUST contain the MAILBOX, TAG, and UIDVALIDITY correlators.
          Correlators allow clients to issue several ESEARCH commands at once (pipelined).
          If the SEARCHRES <xref target="RFC5182"/> extension is used in an ESEARCH
          command, that ESEARCH command MUST be executed by the server after all previous
          SEARCH/ESEARCH commands have completed, and before any subsequent SEARCH/ESEARCH
          commands are executed.
          The server MAY perform consecutive ESEARCH commands in parallel
          as long as none of them use the SEARCHRES extension.
        </t>
      </section>

      <section anchor="source-options" title="Source options: specifying mailboxes to search">
        <t>
          The source options, if present, MUST contain a mailbox specifier
          as defined in the IMAP NOTIFY extension <xref target="RFC5465"/>,
          section 6 (using the "filter-mailboxes" ABNF item), with the
          following differences:
          <list style="numbers">
            <t>
              The "selected-delayed" specifier is not valid here.
            </t>
            <t>
              A "subtree-one" specifier is added.  The "subtree" specifier
              results in a search of the specified mailbox and all selectable
              mailboxes that are subordinate to it, through an indefinitely
              deep hierarchy.
              The "subtree-one" specifier results in a search of
              the specified mailbox and all selectable child mailboxes,
              one hierarchy level down.
            </t>
          </list>
        </t>

        <t>
          If "subtree" is specified, the server MUST defend against loops
          in the hierarchy (for example, those caused by recursive file-system
          links within the message store).
          The server SHOULD do this by keeping track of the mailboxes that have
          been searched, and terminating the hierarchy traversal when a repeat
          is found.
          If it cannot do that, it MAY do it by limiting the hierarchy depth.
        </t>

        <t>
          If the source options are not present, the value "selected" is
          assumed -- that is, only the currently selected mailbox is searched.
        </t>

        <t>
          The "personal" source option is a particularly convenient way to search all of
          the current user's mailboxes.
          Note that there is no way to use wildcard characters to search all mailboxes;
          the "mailboxes" source option does not do wildcard expansion.
        </t>

        <t>
          If the source options include (or default to) "selected",
          the IMAP session MUST be in "selected" state.
          If the source options specify other mailboxes and NOT "selected",
          then the IMAP session MUST be in either "selected" or "authenticated" state.
          If the session is not in a correct state,
          the ESEARCH command MUST return a "BAD" result.
        </t>

        <t>
          If the server supports the SEARCHRES <xref target="RFC5182"/> extension, then
          the "SAVE" result option is valid *only* if "selected" is specified
          or defaulted as the sole mailbox to be searched.
          If any source option other than "selected" is specified,
          the ESEARCH command MUST return a "BAD" result.
        </t>
	  
	  
        <t>
          If the server supports the CONTEXT=SEARCH and/or CONTEXT=SORT extension
          <xref target="RFC5267"/>, then the following additional rules apply:
          <list style="symbols">
            <t>
              The CONTEXT return option (Section 4.2 of <xref target="RFC5267"/>) can be used
              with an ESEARCH command.
            </t>
		
            <t>
              If the UPDATE return option is used (Section 4.3 of <xref target="RFC5267"/>),
              it MUST apply ONLY to the the currently selected mailbox.
              If UPDATE is used and there is no mailbox currently selected,
              the ESEARCH command MUST return a "BAD" result.
            </t>

            <t>
              The PARTIAL search return option (Section 4.4 of <xref target="RFC5267"/>)
              can be used and applies to each mailbox searched by the ESEARCH command.
            </t>
          </list>
        </t>

        <t>
          If the server supports the ACL <xref target="RFC4314"/> extension, then
          the logged in user is required to have the 'r' right for each mailbox
          she wants to search.
          In addition, any mailboxes that are not explicitly named
          (accessed through "personal" or "subtree", for example) are required
          to have the "l" right.
          Mailboxes matching the source options for which
          the logged in user lacks sufficient rights MUST be ignored
          by the ESEARCH command processing.
          In particular, ESEARCH responses MUST NOT be returned for those mailboxes.
        </t>
      </section>

    </section>

    <section anchor="examples" title="Examples">
      <t>
        In the following example, note that two ESEARCH commands are pipelined,
        and that the server is running them in parallel, interleaving a response
        to the second search amid the responses to the first (watch the tags).
      </t>
      <t>
        C: tag1 ESEARCH IN (mailboxes "folder1" subtree "folder2") unseen<vspace/>
        C: tag2 ESEARCH IN (mailboxes "folder1" subtree-one "folder2") subject "chad"<vspace/>
        S: * ESEARCH (TAG "tag1" MAILBOX "folder1" UIDVALIDITY 1) UID ALL 4001,4003,4005,4007,4009<vspace/>
        S: * ESEARCH (TAG "tag2" MAILBOX "folder1" UIDVALIDITY 1) UID ALL 3001:3004,3788<vspace/>
        S: * ESEARCH (TAG "tag1" MAILBOX "folder2/banana" UIDVALIDITY 503) UID ALL 3002,4004<vspace/>
        S: * ESEARCH (TAG "tag1" MAILBOX "folder2/peach" UIDVALIDITY 3) UID ALL 921691<vspace/>
        S: tag1 OK done<vspace/>
        S: * ESEARCH (TAG "tag2" MAILBOX "folder2/salmon" UIDVALIDITY 1111111) UID ALL 50003,50006,50009,50012<vspace/>
        S: tag2 OK done
      </t>
    </section>

    <section anchor="syntax" title="Formal Syntax">
      <t>
        The following syntax specification uses the Augmented Backus-Naur Form (ABNF)
        as described in <xref target="RFC5234"/>.
        Terms not defined here are taken from <xref target="RFC3501"/>, <xref target="RFC5465"/>,
        or <xref target="RFC4466"/>.
      </t>
      <t>
        <list style="hanging" hangIndent="8">

        <t hangText="command-auth =/">esearch<vspace/>
            ; Update definition from IMAP base <xref target="RFC3501"/><vspace/>
            ; Add new "esearch" command.
        </t>

        <t hangText="command-select =/">esearch<vspace/>
            ; Update definition from IMAP base <xref target="RFC3501"/><vspace/>
            ; Add new "esearch" command.
        </t>

        <t hangText="filter-mailboxes-other =/">("subtree-one" SP one-or-more-mailbox)<vspace/>
            ; Update definition from IMAP Notify <xref target="RFC5465"/><vspace/>
            ; Add new "subtree-one" selector.
        </t>

        <t hangText="filter-mailboxes-selected =">"selected"<vspace/>
            ; Update definition from IMAP Notify <xref target="RFC5465"/><vspace/>
            ; We forbid the use of "selected-delayed".
        </t>

        <t hangText="one-correlator =">("TAG" SP tag-string) /
            ("MAILBOX" SP astring) / ("UIDVALIDITY" SP nz-number)<vspace/>
            ; Each correlator MUST appear exactly once
        </t>

        <t hangText="scope-option =">scope-option-name [SP scope-option-value]<vspace/>
            ; No options defined here.  Syntax for future extensions.
        </t>

        <t hangText="scope-option-name =">tagged-ext-label<vspace/>
            ; No options defined here.  Syntax for future extensions.
        </t>

        <t hangText="scope-option-value =">tagged-ext-val<vspace/>
            ; No options defined here.  Syntax for future extensions.
        </t>

        <t hangText="scope-options =">scope-option *(SP scope-option)<vspace/>
            ; A given option may only appear once<vspace/>
            ; No options defined here.  Syntax for future extensions.
        </t>

        <t hangText="esearch =">"ESEARCH" [SP esearch-source-opts]<vspace />
            [SP search-return-opts] SP search-program<vspace />
        </t>

        <t hangText="search-correlator =">SP "(" one-correlator *(SP one-correlator) ")"<vspace/>
            ; Updates definition in IMAP4 ABNF <xref target="RFC4466"/>
        </t>

        <t hangText="esearch-source-opts =">"IN" SP "(" source-mbox [SP<vspace/>
            "(" scope-options ")"] ")"
        </t>

        <t hangText="source-mbox =">filter-mailboxes *(SP filter-mailboxes)<vspace/>
            ; filter-mailboxes is defined in IMAP Notify <xref target="RFC5465"/><vspace/>
            ; See updated definition of filter-mailboxes-other, above.<vspace/>
            ; See updated definition of filter-mailboxes-selected, above.
        </t>

      </list>
      </t>
    </section>

    <section anchor="security" title="Security Considerations">
      <t>
        This new IMAP ESEARCH command allows a single command to search many
        mailboxes at once.
        On the one hand, a client could do that by sending many IMAP SEARCH commands.
        On the other hand, this makes it easier for a client to overwork a
        server, by sending a single command that results in an expensive search
        of tens of thousands of mailboxes.
        Server implementations need to be aware of that, and provide mechanisms
        that prevent a client from adversely affecting other users.
        Limitations on the number of mailboxes that may be searched in one command, and/or
        on the server resources that will be devoted to responding to a single client,
        are reasonable limitations for an implementation to impose.
      </t>            
      <t>
        Implementations MUST, of course, apply access controls appropriately,
        limiting a user's access to ESEARCH in the same way its access is
        limited for any other IMAP commands.
        This extension has no data-access risks beyond what may be there in the unextended
        IMAP implementation.
      </t>            
      <t>
        Mailboxes matching the source options for which
        the logged in user lacks sufficient rights MUST be ignored
        by the ESEARCH command processing
        (see the paragraph about this in <xref target="source-options"/>).
        In particular,
        any attempt to distinguish insufficient access from non-existent mailboxes
        may expose information about the mailbox hierarchy that isn't otherwise
        available to the client.
      </t>
      <t>
        If "subtree" is specified, the server MUST defend against loops in the hierarchy
        (see the paragraph about this in <xref target="source-options"/>).
      </t>
    </section>

    <section anchor="iana" title="IANA Considerations">
      <t>
        IMAP4 capabilities are registered by publishing a standards track or
        IESG approved experimental RFC.  The registry is currently located here:
        <figure>
          <artwork>
            <![CDATA[
   http://www.iana.org/assignments/imap4-capabilities
            ]]>
          </artwork>
        </figure>
        This document defines the IMAP capability "MULTISEARCH"
        and IANA is asked to add it to the registry.
      </t>
    </section>
    
    <section title="Acknowledgements">
      <t>
        The authors gratefully acknowledge feedback provided by Timo Sirainen, Peter Coates
        and Arnt Gulbrandsen.
      </t>
    </section>    
  </middle>

  <back>
    <references title="Normative References">
      &RFC2119; <!-- Kwds -->
      &RFC2978; <!-- CHARSET -->
      &RFC3501; <!-- IMAP -->
      &RFC4314; <!-- IMAP ACL -->
      &RFC4466; <!-- IMAP4 ABNF -->
      &RFC4731; <!-- IMAP ESEARCH -->
      &RFC5182; <!-- IMAP SEARCHRES -->
      &RFC5234; <!-- ABNF -->
      &RFC5267; <!-- IMAP CONTEXT -->
      &RFC5465; <!-- IMAP NOTIFY -->
    </references>
  </back>
</rfc>

