联系方式

  • QQ:99515681
  • 邮箱:99515681@qq.com
  • 工作时间:8:00-21:00
  • 微信:codinghelp

您当前位置:首页 >> C/C++编程C/C++编程

日期:2020-12-06 06:36

Before you turn this problem in, make sure everything runs as expected. First, restart the kernel (in

the menubar, select Kernel Restart) and then run all cells (in the menubar, select Cell Run All).

Make sure you ll

in any place that says YOUR CODE HERE or "YOUR ANSWER HERE", as well as your

name and collaborators below:

→ →

NAME = ""

COLLABORATORS = ""

Copyright Luca de Alfaro, 2019-20. License: CC-BY-NC-ND.

Homework 11: Scheduling with Dependencies

Please submit to this Google Form.

Deadline: Friday December 4, 11pm (check on Canvas for updated information).

Submission

This test contains 4 questions, for a total of 90 points.

Test Format

Assume you have to prepare Pasta Carbonara. My version of the recipe goes like this:

Dice onions and pancetta, and fry in a mix of olive oil and butter, slowly. Separately,

put in a bowl as many eggs as there are dinner guests; you can either put in the bowls

the yolks only, or you can add a few whites if you wish. Beat the eggs.

Bring water to a boil, and when it boils, salt it. Put the pasta in (I like Penne Rigate).

When cooked, colander the water away, and quickly unite in the bowl the beaten eggs,

the pasta, and the pancetta. Mix well and serve immediately.

If you have to invite people over, you could do this recipe sequentially, and rst

worry about cooking

the pasta: warming the water, putting the pasta in, then colandering it. Then you could worry about

cooking the pancetta and onions. When that's done, you can start to beat the eggs. Finally, you

could unite everything. Technically, that would work, but there would be two problems. The rst

is

that, of course, the pasta would be rather cold by the time it would be served, a capital sin (pasta

2020/12/2 “Homework_11_Scheduling_with_Dependencies.ipynb”的副本 - Colaboratory

https://colab.research.google.com/drive/1KYm1ULxx2408rSNwPr4ajC2qNhZ9FkeH?authuser=1#scrollTo=LpjAWZpXSUOQ&printMode=true 2/19

must be served immediately after it is cooked). Secondly, even if you rehash the order so that you

rst

cook the pancetta, then beat the eggs, then cook the pasta, then technically this works -- but it

would take you well over one hour to have everything ready. You want to do things in parallel,

cooking the pancetta while heating up the water for the pasta, and so forth. You want to discover

what are the things that need to be done one after the other, and what are the things that can be

done in parallel, and in which order to do everything.

Great cooking, by the way, is much about the perfect timing, not only the perfect preparation. You

have to have the various preparations ready at the same time, to unite them just right. We will worry

about timing in the second part of this chapter; rst,

we worry about what we can do and in which

order.

As an aside for those of you who are more interested in compiling code than in cooking, the

problem of how to compile C or C++ code is very similar. A makele

denes

dependencies between

tasks: you have to have compiled pathlib.c before you can link the result together with something

else. The task of the make program is to gure

out how to parallelize the compilation, so that

independent tasks can happen in different processes (possibly on different CPU cores), while

respecting the precedence constraints between tasks. We will mention this application in some of

the exercises of the chapter.

Scheduling dependent tasks

We rst

disregard the problem of cooking (or compiling) time, and ask about the order in which we

should be doing the tasks. We want to create a Scheduler object, that can tell us what to do at the

same time. What operations should this object support?

add_task: we should be able to add a task, along with the task dependencies.

reset: indicating that we are about to run the sequences of tasks again.

available_tasks: this property should return the set of things that we can do in parallel.

mark_completed: used to notify the scheduler that we have completed a task. This should

return the set of new tasks that we can do due to this task being completed; we can do these

tasks in parallel alongside with the others that we are already doing.

all_done: returns True/False according to whether we have completed all tasks.

Choosing these operations is perhaps the most important step in the design of the scheduler. The

operations need to have a simple, clear denition,

and be useful in a concrete implementation of the

service which will run the tasks. Of the above operations, they are all uncontroversial, except for the

choice of behavior of completed. In theory, there is no need for completed to return the set of new

tasks that can now be undertaken. If one remembers the set of tasks one can a do before a task

is completed, and marks as completed, one can simply ask the scheduler for the set of

T1

t ∈ T1 t

2020/12/2 “Homework_11_Scheduling_with_Dependencies.ipynb”的副本 - Colaboratory

https://colab.research.google.com/drive/1KYm1ULxx2408rSNwPr4ajC2qNhZ9FkeH?authuser=1#scrollTo=LpjAWZpXSUOQ&printMode=true 3/19

tasks that can now be done, and add those in for execution. However,

we guess (as we have not yet written the task execution engine) that being told this set of tasks

directly will simplify the design of the task execution engine.

T2 T21t = T2 ∖ ({t} ∪ T1 )

Our scheduler class will be implemented in similar fashion to our graph class, with tasks

corresponding to graph vertices, and dependencies represented as edges. The difference is that

here, given a vertex (that is, a task) , it will be useful to be able to access both:

the predecessors of , that is, the tasks that are declared as prerequisites of , and

the successors of , that is, the tasks such that was declared as a prerequisite for .

When we add a task, we would have to initialize its set of successors and predecessors to empty.

This is somewhat tedious, and so we resort to a defaultdict, which is a special type of dictionary

such that, if the mapping for a key has not been dened,

it returns a default value; in our case, an

empty set. You can read more about defaultdict and related types here.

Our rst

implementation of the class is as follows. We let you complete the available_tasks and

mark_completed methods.

v

v u v

v u v u

from collections import defaultdict

import networkx as nx # Library for displaying graphs.

import matplotlib.pyplot as plt

class DependencyScheduler(object):

   def __init__(self):

       self.tasks = set()

       # The successors of a task are the tasks that depend on it, and can

       # only be done once the task is completed.

       self.successors = defaultdict(set)

       # The predecessors of a task have to be done before the task.

       self.predecessors = defaultdict(set)

       self.completed_tasks = set() # completed tasks

   def add_task(self, t, dependencies):

       """Adds a task t with given dependencies."""

       # Makes sure we know about all tasks mentioned.

       assert t not in self.tasks or len(self.predecessors[t]) == 0, "The task was

       self.tasks.add(t)

       self.tasks.update(dependencies)

       # The predecessors are the tasks that need to be done before.

       self.predecessors[t] = set(dependencies)

       # The new task is a successor of its dependencies.

       for u in dependencies:

           self.successors[u].add(t)

   def reset(self):

2020/12/2 “Homework_11_Scheduling_with_Dependencies.ipynb”的副本 - Colaboratory

https://colab.research.google.com/drive/1KYm1ULxx2408rSNwPr4ajC2qNhZ9FkeH?authuser=1#scrollTo=LpjAWZpXSUOQ&printMode=true 4/19

    ( )

       self.completed_tasks = set()

   @property

   def done(self):

       return self.completed_tasks == self.tasks

   def show(self):

       """We use the nx graph to display the graph."""

       g = nx.DiGraph()

       g.add_nodes_from(self.tasks)

       g.add_edges_from([(u, v) for u in self.tasks for v in self.successors[u]])

       node_colors = ''.join([('g' if v in self.completed_tasks else 'r')

                          for v in self.tasks])

       nx.draw(g, with_labels=True, node_color=node_colors)

       plt.show()

   @property

   def uncompleted(self):

       """Returns the tasks that have not been completed.

       This is a property, so you can say scheduler.uncompleted rather than

       scheduler.uncompleted()"""

       return self.tasks - self.completed_tasks

   def _check(self):

       """We check that if t is a successor of u, then u is a predecessor

       of t."""

       for u in self.tasks:

           for t in self.successors[u]:

               assert u in self.predecessors[t]

Question 1: implement available_tasks and mark_completed .

### Implementation of `available_tasks` and `mark_completed`.

def scheduler_available_tasks(self):

   """Returns the set of tasks that can be done in parallel.

   A task can be done if all its predecessors have been completed.

   And of course, we don't return any task that has already been

   completed."""

   r = set()

   for i in self.uncompleted:

       if self.predecessors[i].issubset(self.completed_tasks):

           r.add(i)

       elif self.predecessors[i] == set():

           r.add(i)

   return r

2020/12/2 “Homework_11_Scheduling_with_Dependencies.ipynb”的副本 - Colaboratory

https://colab.research.google.com/drive/1KYm1ULxx2408rSNwPr4ajC2qNhZ9FkeH?authuser=1#scrollTo=LpjAWZpXSUOQ&printMode=true 5/19

def scheduler_mark_completed(self, t):

   """Marks the task t as completed, and returns the additional

   set of tasks that can be done (and that could not be

   previously done) once t is completed."""

   r = set()

   self.completed_tasks.add(t)

   for i in self.successors[t]:

       if self.predecessors[i].issubset(self.completed_tasks):

           r.add(i)

   return r

DependencyScheduler.available_tasks = property(scheduler_available_tasks)

DependencyScheduler.mark_completed = scheduler_mark_completed

# Here is a place where you can test your code.

# YOUR CODE HERE

Let us check if this works.

# Let us ensure that nose is installed.

try:

   from nose.tools import assert_equal, assert_true

   from nose.tools import assert_false, assert_almost_equal

except:

   !pip install nose

   from nose.tools import assert_equal, assert_true

   from nose.tools import assert_false, assert_almost_equal

from nose.tools import assert_true, assert_false, assert_equal

s = DependencyScheduler()

s.add_task('a', ['b', 'c'])

s.add_task('b', ['c', 'e'])

s._check()

s.show()

2020/12/2 “Homework_11_Scheduling_with_Dependencies.ipynb”的副本 - Colaboratory

https://colab.research.google.com/drive/1KYm1ULxx2408rSNwPr4ajC2qNhZ9FkeH?authuser=1#scrollTo=LpjAWZpXSUOQ&printMode=true 6/19

/usr/local/lib/python3.6/dist-packages/networkx/drawing/nx_pylab.py:478: MatplotlibDeprecationWa

label=label,

We note that in the above drawing, the edges denote temporal succession, that is, an edge from

to means that must happen before . Let us execute the schedule manually.

c

a c a

Here are some tests for available_tasks and mark_completed .

### Simple tests. 5 points.

s = DependencyScheduler()

s.add_task('a', [])

assert_equal(s.available_tasks, {'a'})

### Slightly more complicated. 10 points.

s = DependencyScheduler()

s.add_task('a', ['b', 'c'])

s.add_task('b', ['c', 'e'])

assert_equal(s.available_tasks, {'e', 'c'})

s = DependencyScheduler()

s.add_task('a', ['b'])

s.add_task('b', ['a'])

assert_equal(s.available_tasks, set())

### Now, let's test `mark_completed`.  Simple tests first. 5 points.

s = DependencyScheduler()

s.add_task('a', [])

assert_equal(s.available_tasks, {'a'})

r = s.mark_completed('a')

assert_equal(r, set())

s = DependencyScheduler()

2020/12/2 “Homework_11_Scheduling_with_Dependencies.ipynb”的副本 - Colaboratory

https://colab.research.google.com/drive/1KYm1ULxx2408rSNwPr4ajC2qNhZ9FkeH?authuser=1#scrollTo=LpjAWZpXSUOQ&printMode=true 7/19

s.add_task('a', ['b'])

assert_equal(s.available_tasks, {'b'})

r = s.mark_completed('b')

assert_equal(r, {'a'})

### Slightly more complicated. 10 points.

s = DependencyScheduler()

s.add_task('a', ['b', 'c'])

assert_equal(s.available_tasks, {'b', 'c'})

r = s.mark_completed('b')

assert_equal(r, set())

assert_equal(s.available_tasks, {'c'})

r = s.mark_completed('c')

assert_equal(r, {'a'})

s = DependencyScheduler()

s.add_task('a', ['b', 'c'])

s.add_task('b', ['c', 'e'])

s.add_task('c', [])

assert_equal(s.available_tasks, {'c', 'e'})

r = s.mark_completed('e')

assert_equal(r, set())

r = s.mark_completed('c')

assert_equal(r, {'b'})

r = s.mark_completed('b')

assert_equal(r, {'a'})

r = s.mark_completed('a')

assert_equal(r, set())

assert_equal(s.available_tasks, set())

Here is an execution engine for our tasks with dependencies.

Executing the tasks

import random

def execute_schedule(s, show=False):

   s.reset()

   in_process = s.available_tasks

   print("Starting by doing:", in_process)

   while len(in_process) > 0:

       # Picks one random task to be the first to be completed.

       t = random.choice(list(in_process))

print("Completed:" t)

2020/12/2 “Homework_11_Scheduling_with_Dependencies.ipynb”的副本 - Colaboratory

https://colab.research.google.com/drive/1KYm1ULxx2408rSNwPr4ajC2qNhZ9FkeH?authuser=1#scrollTo=LpjAWZpXSUOQ&printMode=true 8/19

       print( Completed: , t)

       in_process = in_process - {t} | s.mark_completed(t)

       print("Now doing:", in_process)

       if show:

           s.show()

   # Have we done all?

   if not s.done:

       print("Error, there are tasks that could not be completed:", s.uncompleted)

Let's try it on our old schedule:

s = DependencyScheduler()

s.add_task('a', ['b', 'c'])

s.add_task('b', ['c', 'e'])

s._check()

s.show()

execute_schedule(s, show=True)

What happens if there is a loop?

s = DependencyScheduler()

s.add_task('a', ['b'])

s.add_task('b', ['a'])

s.add_task('c', ['a'])

execute_schedule(s)

Ok, this is reasonable! Let us now encode our Carbonara pasta recipe.

carbonara = DependencyScheduler()

# First, the part about cooking the pancetta.

carbonara.add_task('dice onions', [])

carbonara.add_task('dice pancetta', [])

carbonara.add_task('put oil and butter in pan', [])

carbonara.add_task('put pancetta in pan', ['dice pancetta'])

carbonara.add_task('put onions in pan', ['dice onions'])

carbonara.add_task('cook pancetta', ['put oil and butter in pan',

                                    'put pancetta in pan',

'put onions in pan'])

# Second, the part about beating the eggs.

carbonara.add task('put eggs in bowl', [])

2020/12/2 “Homework_11_Scheduling_with_Dependencies.ipynb”的副本 - Colaboratory

https://colab.research.google.com/drive/1KYm1ULxx2408rSNwPr4ajC2qNhZ9FkeH?authuser=1#scrollTo=LpjAWZpXSUOQ&printMode=true 9/19

_ ( p gg , [])

carbonara.add_task('beat eggs', ['put eggs in bowl'])

# Third, cooking the pasta.

carbonara.add_task('fill pot with water', [])

carbonara.add_task('bring pot of water to a boil', ['fill pot with water'])

carbonara.add_task('add salt to water', ['bring pot of water to a boil'])

carbonara.add_task('put pasta in water', ['bring pot of water to a boil',

                                        'add salt to water

carbonara.add_task('colander pasta', ['put pasta in water'])

# And finally, we can put everything together.

carbonara.add_task('serve', ['beat eggs', 'cook pancetta', 'colander pasta'])

# Let's look at our schedule!

carbonara.show()

# And let's finally prepare carbonara!

execute_schedule(carbonara)

This is not necessarily the best order of actions to prepare pasta carbonara, but it denitely

works

as a schedule.

Building a Better Execution Engine

Let us build a better execution engine for our schedules. Right now, we have a function:

def execute_schedule(s, show=False):

   s.reset()

   in_process = s.available_tasks

   print("Starting by doing:", in_process)

   while len(in_process) > 0:

       # Picks one random task to be the first to be completed.

       t = random.choice(list(in_process))

       print("Completed:", t)

       in_process = in_process - {t} | s.mark_completed(t)

       print("Now doing:", in_process)

       if show:

           s.show()

   # Have we done all?

   if not s.done:

       print("Error, there are tasks that could not be completed:", s.uncompleted)

2020/12/2 “Homework_11_Scheduling_with_Dependencies.ipynb”的副本 - Colaboratory

https://colab.research.google.com/drive/1KYm1ULxx2408rSNwPr4ajC2qNhZ9FkeH?authuser=1#scrollTo=LpjAWZpXSUOQ&printMode=true 10/19

We want to wrap these methods into a class, RunSchedule. This will allow us more exibility

in

executing a schedule, as we will be able to specify parameters that guide the execution policy,

interrupt and resume the execution, and so on.

An object of class RunSchedule is initialized with a DependencyScheduler. It then has the following

methods:

reset: mark all tasks as not completed.

step: perform one step in the schedule, completing a single task.

run: performs all steps in the schedule, until completion.

done: indicates that all tasks have been done.

What should these methods return? step will return the task executed, while run will return the

whole list of tasks, in the order in which they were done.

class RunSchedule(object):

   def __init__(self, scheduler):

       self.scheduler = scheduler

       self.in_process = None # Indicating, we don't know yet.

   def reset(self):

       self.scheduler.reset()

       self.in_process = None

   def step(self):

       """Performs a step, returning the task, if any, or None,

       if there is no step that can be done."""

       # If we don't know what steps are in process, we get them.

       if self.in_process is None:

           self.in_process = self.scheduler.available_tasks

       if len(self.in_process) == 0:

           return None

       t = random.choice(list(self.in_process))

       self.in_process = self.in_process - {t} | self.scheduler.mark_completed(t)

       return t

   @property

   def done(self):

       return self.scheduler.done

   def run(self):

       """Runs the scheduler from the current configuration to completion.

       You must call reset() first, if you want to run the whole schedule."""

       tasks = []

       while not self.done:

           t = self.step()

           if t is not None:

tasks.append(t)

2020/12/2 “Homework_11_Scheduling_with_Dependencies.ipynb”的副本 - Colaboratory

https://colab.research.google.com/drive/1KYm1ULxx2408rSNwPr4ajC2qNhZ9FkeH?authuser=1#scrollTo=LpjAWZpXSUOQ&printMode=true 11/19

               tasks.append(t)

       return tasks

We can run our pasta carbonara with this RunSchedule class:

runner = RunSchedule(carbonara)

runner.reset()

runner.run()

Let us pause for a moment and ask: did we really need to create a new class? Could we not have

done the above in the scheduler class?

This is debatable. The idea in keeping the two classes separate is clarity of goals:

The scheduler is concerned with what can be done next.

The runner is concerned with any practical constraint to the execution, and with the choice of

what, among the possible, is actually done.

We will have occasion below to rely on this division of concerns.

Code changes and rotten eggs

Imagine that you need to compile three programs, , , and then link together the results into

. Once this is done, you compile and , and link into . As the last step, you link the

two libraries and together, and produce . You do it once. Great. But now you realize

that you need to change . Do you have to start from scratch?

You may think, who cares, it's the CPU doing the work, not me. Fair enough, but there are some large

systems that take minutes, dozen of minutes, to compile. If you are compiling the linux kernel on a

low power CPU, it might take hours. Surely you don't want to redo everything from scratch!

So imagine you have the tasks in an intermediate state, with some being completed (possibly all of

them), and some not. You can now mark one of the tasks as incomplete, to signal you need to do it

again. What is the set of tasks that as a consequence should also be marked incomplete? If you

have two tasks and , with being a successor to , if is marked as "undone" as it needs to be

redone, then also will need to be redone, as it might use the results of .

To implement this, we will perform two modications.

First, we will endow our scheduler with a redo

method, which marks a task and all its successors to be redone -- that is, it unmarks them as done.

We let you implement this; you have already seen how to compute reachability in the graph chapter.

Code changes

a c

f. out d e g. out

g. out f. out h

b

x y y x x

y x

2020/12/2 “Homework_11_Scheduling_with_Dependencies.ipynb”的副本 - Colaboratory

https://colab.research.google.com/drive/1KYm1ULxx2408rSNwPr4ajC2qNhZ9FkeH?authuser=1#scrollTo=LpjAWZpXSUOQ&printMode=true 12/19

Question 2: redo for code

### Implementation of `redo`

def dependency_scheduler_redo(self, t):

   """Mark the task t, and all its successors, as undone.

   Returns the set of successor tasks of t, with t included."""

   # YOUR CODE HERE

DependencyScheduler.redo = dependency_scheduler_redo

# Here is a place where you can test your code.

# YOUR CODE HERE

Let us test the implementation.

### Tests for `redo` for code. 10 points.

s = DependencyScheduler()

s.add_task('a', [])

s.add_task('b', ['a'])

s.add_task('c', ['a'])

s.add_task('d', ['b', 'c'])

s.add_task('e', ['a', 'd'])

s.mark_completed('a')

s.mark_completed('b')

s.mark_completed('c')

assert_equal(s.available_tasks, {'d'})

s.redo('b')

assert_equal(s.available_tasks, {'b'})

Next, we implement a runner that has an additional operation redo(t) for a task t.

def run_schedule_redo(self, t):

   """Marks t as to be redone."""

   # We drop everything that was in progress.

   # This also forces us to ask the scheduler for what to redo.

   self.in_process = None

   return self.scheduler.redo(t)

2020/12/2 “Homework_11_Scheduling_with_Dependencies.ipynb”的副本 - Colaboratory

https://colab.research.google.com/drive/1KYm1ULxx2408rSNwPr4ajC2qNhZ9FkeH?authuser=1#scrollTo=LpjAWZpXSUOQ&printMode=true 13/19

RunSchedule.redo = run_schedule_redo

We can now play with it.

runner = RunSchedule(carbonara)

runner.reset()

for _ in range(10):

   print(runner.step())

print("---> readd salt")

print("marking undone:", runner.redo("add salt to water"))

print("completed:", runner.scheduler.completed_tasks)

for _ in range(10):

   print(runner.step())

print("--->redo dice pancetta")

print("marking undone:", runner.redo("dice pancetta"))

print("completed:", runner.scheduler.completed_tasks)

for t in runner.run():

   print(t)

You have learned to sequence the order in which to do tasks so as to respect their dependencies. In

the next chapter, we will learn how to also take into account the time it takes for us to do the tasks.

In the meantime, bon appetit, or rather, guten appetit, or rather, buon appetito!

The act of redoing a cooking step is somewhat different than the act of redoing something in code.

Suppose you cook pasta, unite with it the fried bacon and onions, and then -- terrible mishap -- you

unite with it the beaten egg yolks in which one of the eggs is rotten.

In code, when one le

changes, you only need to redo the things that depend on that le.

In cooking,

it is different: even if nothing changed in the bacon, onions, and cooked pasta, once you add to it

rotten eggs you have to redo the pasta, bacon, onions, etc, as well, as they have now been

contaminated. The root of the problem is that in a makele,

when you combine two les

to compute

a result, you do not destroy the original les,

whereas in cooking, once you combine foods, you don't

have the original foods any longer. Cooking is like a makele

in which, once you combine les,

you

immediately delete them.

So let us come up with a precise denition

of what needs to be redone in cooking, when one of the

steps goes bad (the eggs are rotten, you burn the food on the stove, and so on).

Initially, we label redo the task that needs redoing. We then propagate the label according to these

two rules:

Redoing in cooking

2020/12/2 “Homework_11_Scheduling_with_Dependencies.ipynb”的副本 - Colaboratory

https://colab.research.google.com/drive/1KYm1ULxx2408rSNwPr4ajC2qNhZ9FkeH?authuser=1#scrollTo=LpjAWZpXSUOQ&printMode=true 14/19

If a task is labeled redo, if is a successor of and is completed, then is also labeled

redo.

If a task is labeled redo, and if is a predecessor of , then is also labeled redo (note that

in this case, we are guaranteed that is completed).

The rst

rule corresponds to a forward pass in the dependency garph; the second rule corresponds

to a backward pass in the dependency relation. Once the redo label is propagated, all tasks that are

marked redo are changed from completed, to uncompleted.

We ask you to implement this in code.

v u v u u

v u v u

u

Question 3: redo for recipes

### Implementation of `cooking_redo`

def dependency_scheduler_cooking_redo(self, v):

   """Indicates that the task v needs to be redone, as something went bad.

   This is the "cooking" version of the redo, in which the redo propagates

   to both successors (as for code) and predecessors."""

   # YOUR CODE HERE

DependencyScheduler.cooking_redo = dependency_scheduler_cooking_redo

# Here is a place where you can test your code.

# YOUR CODE HERE

Let us check that the code works. First, a simple example.

### Basic tests for `cooking_redo`. 10 points.

s = DependencyScheduler()

s.add_task('a', [])

s.add_task('b', [])

s.add_task('c', ['a', 'b'])

s.add_task('d', ['c', 'a'])

s.add_task('e', [])

s.add_task('f', ['e'])

s.add_task('g', ['f', 'd'])

s.mark_completed('a')

s.mark_completed('b')

s.mark_completed('c')

s mark completed('d')

2020/12/2 “Homework_11_Scheduling_with_Dependencies.ipynb”的副本 - Colaboratory

https://colab.research.google.com/drive/1KYm1ULxx2408rSNwPr4ajC2qNhZ9FkeH?authuser=1#scrollTo=LpjAWZpXSUOQ&printMode=true 15/19

s.mark_completed( d )

assert_equal(s.available_tasks, {'e'})

s.cooking_redo('c')

# When we redo c, both its successor d, and predecessors a, b have to be redone.

assert_equal(s.available_tasks, {'a', 'b', 'e'})

assert_equal(s.completed_tasks, set())

And now, some slightly more sophisticated tests.

### Advanced tests for `cooking_redo`. 10 points.

s = DependencyScheduler()

s.add_task('a', [])

s.add_task('b', [])

s.add_task('c', ['a', 'b'])

s.add_task('d', ['c', 'a'])

s.add_task('e', [])

s.add_task('f', ['e'])

s.add_task('g', ['f', 'd'])

s.mark_completed('a')

s.mark_completed('b')

s.mark_completed('c')

s.mark_completed('d')

s.mark_completed('e')

assert_equal(s.available_tasks, {'f'})

s.cooking_redo('c')

# When we redo c, both its successor d, and predecessors a, b have to be redone.

assert_equal(s.available_tasks, {'a', 'b', 'f'})

assert_equal(s.completed_tasks, {'e'})

In the schedules we have seen so far, the dependencies are in and one with the other: if a task

depends on , then both and need to be completed before can be started. It is possible to

consider also cases where dependencies are in an _or relation: if depends on in an or way,

then it suces

to complete one of or before starting . For instance, in our Carbonara Pasta

example, it is possible (even though not necessarily advisable) to use shallots in place of onions. In

that case, instead of

carbonara.add_task('put onions in pan', ['dice onions'])

Question 4: Implement And-Or Schedules

a

b, c b c a

a b, c

b c a

2020/12/2 “Homework_11_Scheduling_with_Dependencies.ipynb”的副本 - Colaboratory

https://colab.research.google.com/drive/1KYm1ULxx2408rSNwPr4ajC2qNhZ9FkeH?authuser=1#scrollTo=LpjAWZpXSUOQ&printMode=true 16/19

we could have:

carbonara.add_or_task('put onions in pan', ['dice onions', 'dice shallots'])

so that before putting the (now generally named) onions in a pan, we could choose to dice either

shallots or onions.

Formally, the idea is to endow the Scheduler class with two methods:

add_and_task(self, t, dependencies) adds a task t with list of dependencies dependencies , so

that t can be done when all of the dependencies are done. The task t is called an AND node

in the dependency graph.

add_or_task(self, t, dependencies) adds a task t with list of dependencies dependencies , so

that t can be done when at least one of the dependencies is done. The task t is called an OR

node in the dependency graph.

You need to nd

a way to remember which dependency graph nodes are AND or OR nodes, and you

need to implement the properties done , available_tasks , uncompleted , and the method

mark_completed , to make this class work. Implementing the show method is optional; do it if it helps

you debug your code.

### `AND_OR_Scheduler` implementation

class AND_OR_Scheduler(object):

   def __init__(self):

       # It is up to you to implement the initialization.

       # YOUR CODE HERE

   def add_and_task(self, t, dependencies):

       """Adds an AND task t with given dependencies."""

       # YOUR CODE HERE

   def add_or_task(self, t, dependencies):

       """Adds an OR task t with given dependencies."""

       # YOUR CODE HERE

   @property

   def done(self):

       # YOUR CODE HERE

   @property

   def available_tasks(self):

       """Returns the set of tasks that can be done in parallel.

       A task can be done if:

       - It is an AND task, and all its predecessors have been completed, or

2020/12/2 “Homework_11_Scheduling_with_Dependencies.ipynb”的副本 - Colaboratory

https://colab.research.google.com/drive/1KYm1ULxx2408rSNwPr4ajC2qNhZ9FkeH?authuser=1#scrollTo=LpjAWZpXSUOQ&printMode=true 17/19

       - It is an OR task, and at least one of its predecessors has been comple

       And of course, we don't return any task that has already been

       completed."""

       # YOUR CODE HERE

   def mark_completed(self, t):

       """Marks the task t as completed, and returns the additional

       set of tasks that can be done (and that could not be

       previously done) once t is completed."""

       # YOUR CODE HERE

   def show(self):

       """You can use the nx graph to display the graph.  You may want to ensur

       that you display AND and OR nodes differently."""

       # YOUR CODE HERE

# Here is a place where you can test your code.

# YOUR CODE HERE

Let us do some simple tests. First, for good old AND nodes.

### Simple tests for AND nodes. 10 points.

s = AND_OR_Scheduler()

s.add_and_task('a', ['b', 'c'])

assert_equal(s.available_tasks, {'b', 'c'})

r = s.mark_completed('b')

assert_equal(r, set())

assert_equal(s.available_tasks, {'c'})

r = s.mark_completed('c')

assert_equal(r, {'a'})

assert_equal(s.available_tasks, {'a'})

r = s.mark_completed('a')

assert_equal(r, set())

assert_equal(s.available_tasks, set())

Then, some simple tests for OR nodes.

### Simple tests for OR nodes. 10 points.

s = AND_OR_Scheduler()

s.add_or_task('a', ['b', 'c'])

assert_equal(s.available_tasks, {'b', 'c'})

r = s.mark completed('b')

2020/12/2 “Homework_11_Scheduling_with_Dependencies.ipynb”的副本 - Colaboratory

https://colab.research.google.com/drive/1KYm1ULxx2408rSNwPr4ajC2qNhZ9FkeH?authuser=1#scrollTo=LpjAWZpXSUOQ&printMode=true 18/19

_ p ( )

# Now 'a' becomes available.

assert_equal(r, {'a'})

# But note that 'c' is also available, even if useless.

assert_equal(s.available_tasks, {'a', 'c'})

r = s.mark_completed('a')

assert_equal(r, set())

assert_equal(s.available_tasks, {'c'})

r = s.mark_completed('c')

assert_equal(r, set())

assert_equal(s.available_tasks, set())

Note that a drawback of this simple solution, as illustrated by the above test case, is that we do not

distinguish between the tasks that are useful to do the root task, and the tasks that are useless, that

is, not part of a minimal solution. We simply call them available, as they can be done, even though

there is no advantage in doing them.

### Tests with both AND and OR nodes. 10 points.

s = AND_OR_Scheduler()

s.add_and_task('a', ['b', 'c'])

s.add_or_task('b', ['b1', 'b2'])

s.add_or_task('c', ['c1', 'c2'])

r = s.mark_completed('b1')

assert_equal(s.available_tasks, {'b', 'b2', 'c1', 'c2'})

r = s.mark_completed('b')

assert_false('a' in s.available_tasks)

r = s.mark_completed('c1')

assert_false('a' in s.available_tasks)

r = s.mark_completed('c')

assert_true('a' in s.available_tasks)

s = AND_OR_Scheduler()

s.add_or_task('a', ['b', 'c'])

s.add_and_task('b', ['b1', 'b2'])

s.add_and_task('c', ['c1', 'c2'])

r = s.mark_completed('b1')

assert_equal(s.available_tasks, {'b2', 'c1', 'c2'})

r = s.mark_completed('c1')

assert_equal(s.available_tasks, {'b2', 'c2'})

r = s.mark_completed('c2')

assert_equal(s.available_tasks, {'b2', 'c'})

r = s.mark_completed('c')

assert_true('a' in s.available_tasks)

2020/12/2 “Homework_11_Scheduling_with_Dependencies.ipynb”的副本 - Colaboratory

https://colab.research.google.com/drive/1KYm1ULxx2408rSNwPr4ajC2qNhZ9FkeH?authuser=1#scrollTo=LpjAWZpXSUOQ&printMode=true 19/19


版权所有:编程辅导网 2021 All Rights Reserved 联系方式:QQ:99515681 微信:codinghelp 电子信箱:99515681@qq.com
免责声明:本站部分内容从网络整理而来,只供参考!如有版权问题可联系本站删除。 站长地图

python代写
微信客服:codinghelp