Commit Graph

4871 Commits

Author SHA1 Message Date
Adin Scannell 0fb5de1154 Use a stable ordering for generated types.
Otherwise this pollutes the 'go' branch and doesn't conform to standards
for generate bazel files.

PiperOrigin-RevId: 349605037
2020-12-30 14:53:03 -08:00
Fabricio Voznika 1b66bad7c4 Fix condition checking in `runsc debug`
Closes #5052

PiperOrigin-RevId: 349579814
2020-12-30 11:18:36 -08:00
Adin Scannell ed5850e8eb Add continuous VFS1 tests.
PiperOrigin-RevId: 349517093
2020-12-29 23:28:39 -08:00
Ting-Yu Wang fc153750ed Deflake TestBindOverlay in //test/e2e:integration_test
Test failure happens when `echo` writes late:
* Server nc shuts down write when it sees EOF from stdin.
* Client nc closes the connection when it detects EOF from socket and no data
  in internal write buffer.

Using `-q` flag to make server not shutting down write in the beginning, while
letting connection to be closed by client. As `-q` flag's default value varies
in different netcat versions, we always specify it to prevent future breakage.

PiperOrigin-RevId: 349504016
2020-12-29 20:24:40 -08:00
Adin Scannell ffa9a715aa Simplify profiling and benchmarks.
- Tweak the benchmarks to work with b.N where appropriate. In many cases,
  b.N was simply being ignored. This creates an implicit dependency in the
  user passing a reasonable benchtime (less than or equal to the actual
  runtime of the test, or using the X syntax) otherwise the test runs
  forever.
- In cases where the above is impossible, explicitly set benchtime from
  the test wrapper, to prevent the above behavior (tensorflow).
- Drop the *Reverse variants, which are simply hey benchmarks. We should
  just add a hey benchmark. The platforms benchmarks already include a
  native platform, and thus these benchmarks are incredibly confusing.
  (In other words, BenchmarkNginxReverse has nothing to do with an nginx
  benchmark for runsc.)
- Remove the redunant Harness object, which contains no state, in order
  to slightly simplify the code.
- Make Block and Heap profiling actually work, but setting appropriate
  runtime parameters (and plumbing them through the config).
- Split the profiling into two phases: start and stop, since some will
  need to be started early, and others will need to happen at the end.

PiperOrigin-RevId: 349495377
2020-12-29 18:29:12 -08:00
Adin Scannell 3c58405a54 Exclude changing generated file from go_branch.
PiperOrigin-RevId: 349491589
2020-12-29 17:46:00 -08:00
Jing Chen d302c05700 Deflake semaphore_test_native
PiperOrigin-RevId: 349490873
2020-12-29 17:38:23 -08:00
Adin Scannell 85c1c3ed4b Make profiling commands synchronous.
This allows for a model of profiling when you can start collection, and
it will terminate when the sandbox terminates. Without this synchronous
call, it is effectively impossible to collect length blocking and mutex
profiles.

PiperOrigin-RevId: 349483418
2020-12-29 16:23:01 -08:00
Tamir Duberstein 91c05c609e Export socket_ipv6_udp_unbound_loopback.cc
PiperOrigin-RevId: 349458984
2020-12-29 12:56:59 -08:00
Tamir Duberstein 3ff7324dfa Avoid duplicate `run` key
This GitHub action has been failing since 4e963c9.

PiperOrigin-RevId: 349327675
2020-12-28 13:57:54 -08:00
gVisor bot 5dc60a7632 Merge pull request #5166 from kevinGC:bk-warnings
PiperOrigin-RevId: 349314945
2020-12-28 12:12:35 -08:00
Kevin Krakauer 5259b90dac Resolve compiler warnings
The buildkite syscall tests start with a lot of C++ warnings. This
cleans that up a little.
2020-12-23 16:49:01 -08:00
Andrei Vagin d079159876 vfs1: don't allow to open socket files
open() has to return ENXIO in this case.

O_PATH isn't supported by vfs1.

PiperOrigin-RevId: 348820478
2020-12-23 11:11:07 -08:00
gVisor bot 6d96a2394d Internal change.
PiperOrigin-RevId: 348720223
2020-12-22 17:46:41 -08:00
Kevin Krakauer 3a5c2cf42c Add missing link types to go branch
PiperOrigin-RevId: 348710397
2020-12-22 16:21:03 -08:00
Nayana Bidari 7c8ba72b02 Move SO_BINDTODEVICE to socketops.
PiperOrigin-RevId: 348696094
2020-12-22 14:44:02 -08:00
Tamir Duberstein 202e9fa369 Correctly log sniffed ARP packets
This condition was inverted in 360006d.

PiperOrigin-RevId: 348679088
2020-12-22 12:52:19 -08:00
Peter Johnston fee2cd640f Invoke address resolution upon subsequent traffic to Failed neighbor
Removes the period of time in which subseqeuent traffic to a Failed neighbor
immediately fails with ErrNoLinkAddress. A Failed neighbor is one in which
address resolution fails; or in other words, the neighbor's IP address cannot
be translated to a MAC address.

This means removing the Failed state for linkAddrCache and allowing transitiong
out of Failed into Incomplete for neighborCache. Previously, both caches would
transition entries to Failed after address resolution fails. In this state, any
subsequent traffic requested within an unreachable time would immediately fail
with ErrNoLinkAddress. This does not follow RFC 4861 section 7.3.3:

  If address resolution fails, the entry SHOULD be deleted, so that subsequent
  traffic to that neighbor invokes the next-hop determination procedure again.
  Invoking next-hop determination at this point ensures that alternate default
  routers are tried.

The API for getting a link address for a given address, whether through the link
address cache or the neighbor table, is updated to optionally take a callback
which will be called when address resolution completes. This allows `Route` to
handle completing link resolution internally, so callers of (*Route).Resolve
(e.g. endpoints) don’t have to keep track of when it completes and update the
Route accordingly.

This change also removes the wakers from LinkAddressCache, NeighborCache, and
Route in favor of the callbacks, and callers that previously used a waker can
now just pass a callback to (*Route).Resolve that will notify the waker on
resolution completion.

Fixes #4796

Startblock:
  has LGTM from sbalana
  and then
  add reviewer ghanan
PiperOrigin-RevId: 348597478
2020-12-22 01:37:05 -08:00
Ghanan Gowripalan 620de250a4 Prefer matching labels and longest matching prefix
...when performing source address selection for IPv6.

These are defined in RFC 6724 section 5 rule 6 (prefer matching label)
and rule 8 (use longest matching prefix).

This change also considers ULA of global scope instead of its own scope,
as per RFC 6724 section 3.1:

   Also, note that ULAs are considered as global, not
   site-local, scope but are handled via the prefix policy table as
   discussed in Section 10.6.

Test: stack_test.TestIPv6SourceAddressSelectionScope

Startblock:
  has LGTM from peterjohnston
  and then
  add reviewer brunodalbo
PiperOrigin-RevId: 348580996
2020-12-21 22:26:10 -08:00
Andrei Vagin 946cb909e6 Don't modify a packet header when it can be used by other endpoints
Reported-by: syzbot+48c43f82fe7738fceae9@syzkaller.appspotmail.com
PiperOrigin-RevId: 348540796
2020-12-21 15:50:33 -08:00
Kevin Krakauer 981faa2c12 RLock Endpoint in raw.Endpoint.HandlePacket
PiperOrigin-RevId: 348530530
2020-12-21 14:44:39 -08:00
Etienne Perot 9a72730f24 Typo fix.
PiperOrigin-RevId: 348106699
2020-12-17 15:39:03 -08:00
Chong Cai 433fd0e646 Set verityMu to be state nosave
PiperOrigin-RevId: 348092999
2020-12-17 14:23:02 -08:00
gVisor bot a0209de301 Internal change.
PiperOrigin-RevId: 348089449
2020-12-17 14:05:01 -08:00
Fabricio Voznika eeee055d60 Set --nocache_test_results to runtime tests
If not set, the cached result is used even when runtime options
are changed, because they are not visible to blaze/bazel.

PiperOrigin-RevId: 348074339
2020-12-17 12:42:09 -08:00
Nicolas Lacasse 1ea241e4cc Fix seek on /proc/pid/cmdline when task is zombie.
PiperOrigin-RevId: 348056159
2020-12-17 11:16:06 -08:00
Ayush Ranjan 028271b530 [netstack] Implement IP(V6)_RECVERR socket option.
PiperOrigin-RevId: 348055514
2020-12-17 11:10:41 -08:00
Fabricio Voznika 30860902f6 Set process group and session on host TTY
Closes #5128

PiperOrigin-RevId: 348052446
2020-12-17 10:58:18 -08:00
Fabricio Voznika 8ea19b5818 Add sandbox ID to state file name
This allows to find all containers inside a sandbox more efficiently.
This operation is required every time a container starts and stops,
and previously required loading *all* container state files to check
whether the container belonged to the sandbox.

Apert from being inneficient, it has caused problems when state files
are stale or corrupt, causing inavalability to create any container.

Also adjust commands `list` and `debug` to skip over files that fail
to load.

Resolves #5052

PiperOrigin-RevId: 348050637
2020-12-17 10:52:44 -08:00
Fabricio Voznika e7493a9e23 Set max memory not min
Closes #5048

PiperOrigin-RevId: 348050472
2020-12-17 10:46:47 -08:00
Ayush Ranjan 74788b1b61 [netstack] Implement MSG_ERRQUEUE flag for recvmsg(2).
Introduces the per-socket error queue and the necessary cmsg mechanisms.

PiperOrigin-RevId: 348028508
2020-12-17 08:47:24 -08:00
Tamir Duberstein 4640fc4f35 Remove duplicate `return`
PiperOrigin-RevId: 347974624
2020-12-17 00:40:33 -08:00
Ghanan Gowripalan c740865f86 Cleanup locking in multicast group protocol tests
Startblock:
  has LGTM from asfez
  and then
  add reviewer tamird
PiperOrigin-RevId: 347928471
2020-12-16 17:26:06 -08:00
gVisor bot 0ac6636aaf Automated rollback of changelist 346565589
PiperOrigin-RevId: 347911316
2020-12-16 15:41:03 -08:00
gVisor bot 2ec6e44c9e Merge pull request #4880 from lubinszARM:pr_tlbi_02
PiperOrigin-RevId: 347890782
2020-12-16 13:52:58 -08:00
Nayana Bidari 0c92b3782a Add support to count the number of packets SACKed.
sacked_out is required in RACK to check the number of duplicate
acknowledgements during updating the reorder window. If there is no reordering
and the value for sacked_out is greater than the classic threshold value 3,
then reorder window is set to zero.
It is calculated by counting the number of segments sacked in the ACK and is
reduced when a cumulative ACK is received which covers the SACK blocks. This
value is set to zero when the connection enters recovery.

PiperOrigin-RevId: 347872246
2020-12-16 12:19:21 -08:00
Adin Scannell 7da25e6dc3 Restore refresh target.
PiperOrigin-RevId: 347864621
2020-12-16 11:45:28 -08:00
Mithun Iyer b645fcd241 Ensure correctness of saved receive window
When the scaled receive window size > 65535 (max uint16), we advertise
the scaled value as 65535, but are not adjusting the saved receive
window value when doing so. This would keep our current window
calculation logic to be incorrect, as the saved receive window value
is different from what was advertised.

Fixes #4903

PiperOrigin-RevId: 347771340
2020-12-16 00:24:28 -08:00
Ghanan Gowripalan c55e5bda4d Validate router alert's data length
RFC 2711 specifies that the router alert's length field is always 2
so we should make sure only 2 bytes are read from a router alert
option's data field.

Test: header.TestIPv6OptionsExtHdrIterErr
PiperOrigin-RevId: 347727876
2020-12-15 17:45:22 -08:00
Andrei Vagin 97406b20a1 Internal change.
PiperOrigin-RevId: 347720083
2020-12-15 16:51:35 -08:00
Ghanan Gowripalan 50c658a9f6 Don't split enabled flag across multicast group state
Startblock:
  has LGTM from asfez
  and then
  add reviewer brunodalbo
PiperOrigin-RevId: 347716242
2020-12-15 16:28:53 -08:00
Jing Chen 1e56a2f9a2 Implement command SEM_INFO and SEM_STAT for semctl.
PiperOrigin-RevId: 347711998
2020-12-15 16:06:06 -08:00
Chong Cai 7aa674eb68 Change violation mode to an enum
PiperOrigin-RevId: 347706953
2020-12-15 15:40:04 -08:00
Ayush Ranjan f6407de6ba [syzkaller] Avoid AIOContext from resurrecting after being marked dead.
syzkaller reported the closing of a nil channel. This is only possible when the
AIOContext was destroyed twice.

Some scenarios that could lead to this:
- It died and then some called aioCtx.Prepare() on it and then killed it again
  which could cause the double destroy. The context could have been destroyed
  in between the call to LookupAIOContext() and Prepare().
- aioManager was destroyed but it did not update the contexts map. So
  Lookup could still return a dead AIOContext and then someone could call
  Prepare on it and kill it again.

So added a check in aioCtx.Prepare() for the context being dead. This will
prevent a dead context from resurrecting.
Also refactored code to destroy the aioContext consistently. Earlier we were not
munmapping the aioContexts that were destroyed upon aioManager destruction.

Reported-by: syzbot+ef6a588d0ce6059991d2@syzkaller.appspotmail.com
PiperOrigin-RevId: 347704347
2020-12-15 15:27:35 -08:00
Ayush Ranjan cc28d36845 [netstack] Make recvmsg(2) call to host in hostinet even if dst is empty.
We want to make the recvmsg syscall to the host regardless of if the dst is
empty or not so that:
- Host can populate the control messages if necessary.
- Host can return sender address.
- Host can return appropriate errors.

Earlier because we were using the IOSequence.CopyOutFrom() API, the usermem
package does not even call the Reader function if the destination is empty (as
an optimization).

PiperOrigin-RevId: 347684566
2020-12-15 13:48:33 -08:00
gVisor bot e605144938 Internal change.
PiperOrigin-RevId: 347671070
2020-12-15 12:41:02 -08:00
Fabricio Voznika 12ac31ed04 fsgofer optimizations
- Skip chown call in case owner change is not needed
- Skip filepath.Clean() calls when joining paths
- Pass unix.Stat_t by value to reduce runtime.duffcopy calls.
  This change allows for better inlining in localFile.walk().

                                Change            Baseline    Improvement
BenchmarkWalkOne-6           	 2912 ns/op       3082 ns/op     5.5%
BenchmarkCreate-6            	15915 ns/op      19126 ns/op    16.8%
BenchmarkCreateDiffOwner-6	18795 ns/op      19741 ns/op     4.8%

PiperOrigin-RevId: 347667833
2020-12-15 12:23:55 -08:00
gVisor bot 5843a5007c Merge pull request #4722 from zhlhahaha:2010
PiperOrigin-RevId: 347660920
2020-12-15 11:54:04 -08:00
Nayana Bidari b15acae9a6 Fix error code for connect in raw sockets.
PiperOrigin-RevId: 347650354
2020-12-15 11:07:48 -08:00
Adin Scannell 4e963c99ce Cleanup GitHub actions workflows.
Also, drop the pull_request template, since this has not proved to be
helpful, and just results in a commit message the includes the list.

PiperOrigin-RevId: 347636507
2020-12-15 10:05:47 -08:00